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These lectures deal with our current knowledge of the matter distribution in the universe, fo- 
cusing on how this is studied via the large-scale structure seen in galaxy surveys. We first 
assemble the necessary basics needed to understand the development of density fiuctuations in 
an expanding universe, and discuss how galaxies are located within the dark-matter density field. 
Results from the 2dF Galaxy Redshift Survey are presented and contrasted with theoretical mod- 
els. We show that the combination of large-scale structure and data on microwave-background 
anisotropies can eliminate almost all degeneracies, and yield a completely specified cosmological 
model. This is the 'concordance' universe: a geometrically flat combination of vacuum energy 
and cold dark matter. The study of cosmic structure is able to establish this in a manner 
independent of external information, such as the Hubble diagram; this extra information can 
however be used to limit non-standard alternatives, such as a variable equation of state for the 
vacuum. 



1. Preamble 

1.1. The perturbed universe 

It has been clear since the 1930s that galaxies are not distributed at random in the 
universe (Hubble 1934). For decades, our understanding of this fact was limited by the 
lack of a three-dimensional picture, although some impressive progress was made: the 
dedication of pioneers such as Shane & Wirtanen in compiling galaxy catalogues by eye 
is humbling to consider. However, studies of the galaxy distribution came of age in the 
1980s, via redshift surveys, in which Hubble's v = Hd law is used to turn spectroscopic 
redshifts into estimates of distance (e.g. Davis & Peebles 1983; de Lapparant, Geller & 
Huchra 1986; Saunders et al. 1991). We were then able to see clearly (e.g. figure 1) a 
wealth of large-scale structures of size exceeding 100 Mpc. The existence of these cosmo- 
logical structures must be telling us something important about the initial conditions of 
the big bang, and about the physical processes that have operated subsequently. These 
lectures cover some of what we have learned in this regard. 

Throughout, it will be convenient to adopt a notation in which the density (of mass, 
light, or any property) is expressed in terms of a dimensionless density perturbation 5: 

l-t-5(x)^p(x)/(p>, (1) 

where {p) is the global mean density. The quantity 5 need not be small, but writing things 
in this form naturally suggests an approach via perturbation theory in the important 
linear case where 5^1. As we will see, this was a good approximation at early times. 
The existence of this field in the universe raises two questions: what generated it, and 
how does it evolve? A popular answer for the first question is inflation, in which quantum 
fluctuations are able to seed density fluctuations. So far, despite some claims, this theory 
is not tested, and we consider later some ways in which this might be accomplished. 
Mainly, however, we will be concerned here with the question of evolution. 



1 



2 



J. A. Peacock: Surveys and cosmic structure 



CfA: 0° < 5 < 30° 6*" v < 12 000 km s ' 




18'' 



Figure 1. One of the iconic pictures of the large-scale structure in the galaxy distribution is 
this slice made from John Huchra's ZCAT compilation of galaxy redshifts, reflecting the state of 
our knowledge in the mid-1980s. The survey coverage is not quite complete; as well as the holes 
due to the galactic plane around right ascensions 6*^ and 19*^, the rich clusters are somewhat 
over-represented with respect to a true random sampling of the galaxy population. Nevertheless, 
this plot emphasizes nicely both the large-scale features such as the 'great wall' on the left, the 
totally empty void regions, and the radial 'fingers of God' caused by virialized motions in the 
clusters. One of the principal challenges in cosmology is to explain this pattern. 



1.2. Relativistic viewpoint and gauge issues 

Many of the key aspects of the evolution of structure in the universe can be dealt with 

via a deceptively simple Newtonian approach, but honesty requires a brief overview of 
some of the difficult issues that will be evaded by taking this route. 

Because relativistic physics equations are written in a covariant form in which all 
quantities arc independent of coordinates, relativity docs not distinguish between active 
changes of coordinate (e.g. a Lorentz boost) or passive changes (a mathematical change of 
variable, normally termed a gauge transformation). This generality is a problem, since 
it is not trivial to know which coordinates should be used. To sec how the problems 
arise, ask how tensors of different order change under a gauge transformation x'^ —^ 
J.IH _ jj/i ^ g/i Consider first a scalar quantity S (which might be density, temperature 
etc.). A scalar quantity in relativity is normally taken to be independent of coordinate 
frame, but this is only for the case of Lorentz transformations, which do not involve a 
change of the spacetime origin. A gauge transformation therefore not only induces the 
usual transformation coefficients dx'^ jdx" , but also involves a translation that relabels 
spacetime points. We therefore have to deal with S'{x^ -|- e^) = S{x^), so the rule for 
the gauge transformation of scalars is 



5'(x^) ^S(x^)- e'^dS/dx''. 



(2) 
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Similar reasoning yields the gauge transformation laws for higher tensors, although we 
need to account not only for the translation of the origin, but also for the usual effect of 
the coordinate; transformation on the tensor. 

Consider applying this to the case of a uniform universe; here p only depends on time, 
so that 



An effective density perturbation is thus produced by a local alteration in the time 
coordinate: when we look at a universe with a fluctuating density, should we really 
think of a uniform model in which time is wrinkled? This ambiguity may seem absurd, 
and in the laboratory it could be resolved empirically by constructing the coordinate 
system directly - in principle by using light signals. This shows that the cosmological 
horizon plays an important role in this topic: perturbations with wavelength \ < ct 
inhabit a regime in which gauge ambiguities can be resolved directly via common sense. 
The real difficulties lie in the super-horizon modes with A > ct. However, at least 
within inflationary models, these difficulties can be overcome. According to inflation, 
perturbations on scales greater than the horizon were originally generated via quantum 
fluctuations on small scales within the horizon of a nearly de Sitter exponential expansion. 
There is thus no problem in understanding how the initial density field is described, since 
the simplest coordinate system can once again be constructed directly. 

The most direct way of solving these difficulties is to construct perturbation variables 
that are explicitly independent of gauge. Comprehensive technical discussions of this 
method arc given by Bardeen (1980), Kodama & Sasaki (1984), Mukhanov, Feldman 
& Brandcnbcrgcr (1992). The starting point for a discussion of metric perturbations 
is to devise a notation that will classify the possible perturbations. Since the metric is 
symmetric, there are 10 independent degrees of freedom in 5^"^; a convenient scheme that 
captures these possibilities is to write the cosmological metric as 



In this equation, rj is conformal time, and 7ij is the comoving spatial part of the 
Robertson- Walker metric. 

The total number of degrees of freedom here is apparently 2 (scalar fields cj) and ij}) -\- 
3 (3- vector field w) + 6 (symmetric 3-tensor hij) = 11. To get the right number, the 
tensor hij is required to be traceless: ^''h-ij = 0. The perturbations can be split into 
three classes: scalar perturbations, which are described by scalar functions of spacetime 
coordinate, and which correspond to the growing density perturbations studied above; 
vector perturbations, which correspond to vorticity perturbations, and tensor perturba- 
tions, which correspond to gravitational waves. Here, we shall concentrate mainly on 
scalar perturbations. Since vectors and tensors can be generated from derivatives of a 
scalar function, the most general scalar perturbation actually makes contributions to all 
the g^v components in the above expansion: 



where four scalar functions (f>, ip, E and B are involved. It turns out that this situation 
can be simplified by defining variables that are unchanged by a gauge transformation: 



/ • 



(3) 



dT^ = 



a^it]) {{1 + 2(j))drf + 2widr]dx' - [(1 - 2ij;)-iij + 2/iy] dx^ dx^] . 



(4) 




(5) 




(6) 
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where primes denote derivatives with respect to conformal time. 

These gauge-invariant 'potentials' have a fairly direct physical interpretation, since 
they are closely related to the Newtonian potential. The easiest way to evaluate the 
gauge-invariant fields is to make a specific gauge choice and work with the longitudinal 
gauge in which E and B vanish, so that <b = cj) and = ^/i. A second key result is 
that inserting the longitudinal metric into the Einstein equations shows that (j) and are 
identical in the case of fluid-like perturbations where off-diagonal elements of the energy- 
momentum tensor vanish. In this case, the longitudinal gauge becomes identical to the 
Newtonian gauge, in which perturbations are described by a single scalar field, which is 
the gravitational potential. The conclusion is thus that the gravitational potential can for 
many purposes give an effectively gauge-invariant measure of cosmological perturbations, 
and this provides a sounder justification for the Newtonian approach that we now adopt. 



2. Newtonian equations of motion 

2.1. Matter- dominated universe 

In the Newtonian approach, we treat dynamics of cosmological matter exactly as we 
would in the laboratory, by finding the equations of motion induced by either pressure 
or gravity. In what follows, it should be remembered that we probably need to deal 
in practice with two rather different kinds of material: dark matter that is coUisionless 
and interacts only via gravity, and baryonic material which is a collisional fiuid, coupled 
to dark matter only via gravity (and to photons via Thomson scattering, so that the 
dominant part of the pressure derives from the radiation). 

Also, the problem of cosmological dynamics has to deal with the characteristic feature 
of the Hubble expansion. This means that it is convenient to introduce comoving length 
units, and to consider primarily peculiar velocities - i.e. deviations from the Hubble flow. 
The standard notation that includes these aspects is 

x(t) = a{t)v{t) 

6v{t) = o{t)u{t), ^ ' 

so that X has units of proper length, i.e. it is an Eulerian coordinate. First note that the 
comoving peculiar velocity u is just the time derivative of the comoving coordinate r: 

X = dr + ar = Hx + ar, (8) 

where the rhs must be equal to the Hubble flow Hx, plus the peculiar velocity = au. 

In this equation, dots stand for exact convectivc time derivatives - i.e. time derivatives 
measured by an observer who follows a particle's trajectory - rather than partial time 
derivatives d/dt. 

The equation of motion follows from writing the Eulerian equation of motion as x = 
go +g) where g = — V$/a is the peculiar acceleration, and go is the acceleration that 
acts on a particle in a homogeneous universe (neglecting pressure forces to start with, 
for simplicity). Differentiating x = ar twice gives 

.. . „. a 

X = au -I- 2au -I- - x = go + g. (9) 

The unperturbed equation corresponds to zero peculiar velocity and zero peculiar accel- 
eration: (o/a) X = go; subtracting this gives the perturbed equation of motion 

u + 2(d/a)u = g/a. (10) 

The only point that needs a little more thought is the nature of the unperturbed equation 
of motion. This cannot be derived from Newtonian gravity alone, since general relativity 
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is really needed for a proper derivation of the homogeneous equation of motion. However, 
as long as we are happy to accept that go is given, then it is a well-defined procedure to 
add a peculiar acceleration that is the gradient of the potential derived from the density 
perturbations. 

The equation of motion for the peculiar velocity shows that u is affected by grav- 
itational acceleration and by the Hubble drag term, 2(d/a)u. This arises because the 
peculiar velocity falls with time as a particle attempts to catch up with successively more 
distant (and therefore more rapidly receding) neighbours. If the proper peculiar velocity 
is V, then after time dt the galaxy will have moved a proper distance x = vdt from 
its original location. Its near neighbours will now be galaxies with recessional velocities 
H X = Hvdt, relative to which the peculiar velocity will have fallen to v — Hx. The 
equation of motion is therefore just 

v = —Hv = ——v, (11) 

with the solution v oc a~^: peculiar velocities of nonrelativistic objects suflfer redshifting 
by exactly the same factor as photon momenta. This becomes ii = —2Hu when rewritten 
in comoving units. 

The peculiar velocity is directly related to the evolution of the density field, through 
conservation of mass. This is expressed via the continuity equation, which takes the form 

j^Po{l + S) = -po{l + S)V-u. (12) 
Here, spatial derivatives are with respect to comoving coordinates: 

V = a Vproper, (13) 

which we will assume hereafter, and the time derivative is once more a convective one: 

^ = #+u.V. (14) 
dt dt ^ ^ 

Finally, when using comoving length units, the background density po independent of 

time, and so the full continuity equation can be written as 

±S = -{l + 6)V-u. (15) 

Unlike the equation of motion for u, this is not linear in the perturbations S and u. To 

cure this, we restrict ourselves to the case S ^ 1 and linearize the equation, neglecting 
second-order terms like (5 x u, which removes the distinction between convective and 
partial time derivatives. The linearized equations for conservation of momentum and 
matter as experienced by fundamental observers moving with the Hubble flow are then: 

u + 2-u=^ , ^ 

a a (16) 

^ = -V-u, 

where the peculiar gravitational acceleration — V^/a is denoted by g. 

The solutions of these equations can be decomposed into modes either parallel to g 
or independent of g (these are the homogeneous and inhomogeneous solutions to the 
equation of motion). The homogeneous case corresponds to no peculiar gravity ~ i.e. 
zero density perturbation. This is consistent with the linearized continuity equation, 
V • u = —S, which says that it is possible to have vorticity modes with V • u = for 
which S vanishes, so there is no growth of structure in this case. The proper velocities of 
these vorticity modes decay as v = au oc , as with the kinematic analysis for a single 
particle. 
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Growing mode For the growing mode, it is most convenient to eliminate u by taking 
the divergence of the equation of motion for u, and the time derivative of the continuity 
equation. This requires a knowledge of V- g, which comes via Poisson's equation: V • g = 
AnGapoS. The resulting 2nd-ordcr equation for 5 is 

6 + 2^6 = 47rGpo S. (17) 

This is easily solved for the Clm = 1 case, where iwGpo = 3H'^/2 = 2/3t^, and a power- 
law solution works: 

5(t)oct2/3 or t-K (18) 

The first solution, with d{t) oc a{t) is the growing mode, corresponding to the gravita- 
tional instability of density perturbations. Given some small initial seed fluctuations, 
this is the simplest way of creating a universe with any desired degree of inhomogeneity. 

An alternative way of looking at the growing mode is that we want to try looking for 
a homogeneous solution u = F(t)g. Then using continuity plus V • g = 4:nGapoS, gives 
us 

where the function /(ilm) = din 6/ din a. A very good approximation to this (Peebles 
1980) is / ~ (a result that is almost independent of A; Lahav et al. 1991). 

Jeans scale So far, we have mainly considered the coUisionlcss component. For the 
photon-baryon gas, all that changes is that the peculiar acceleration gains a term from 
the pressure gradients: 

g = -V$/a- Vp/(apo). (20) 
The pressure fluctuations are related to the density perturbations via the sound speed 

- (21) 

dp 

Now think of a plane-wave disturbance S oc e~^^'^ , where k is a comoving wavevector; 
in other words, suppose that the wavelength of a single Fourier mode stretches with the 
univ(^rsc. All time dependence is carried by the amplitude of the wave, and so the spatial 
dependence can be factored out of time derivatives in the above equations (which would 
not be true with a constant comoving wavenumber k/a). The equation of motion for 5 
then gains an extra term on the rhs from the pressure gradient: 

5 + 2'^5 = 5{ATTGpo-clk'^la^). (22) 

This shows that there is a critical proper wavelength, the Jeans length, at which we switch 
from the possibility of gravity-driven growth for long- wavelength modes to standing sound 
waves at short wavelengths. This critical length is 

A. = 1^ = c../^. (23) 
kja \ Gp 

Qualitatively, we expect to have no growth when the 'driving term' on the rhs is negative. 
However, owing to the expansion, Aj will change with time, and so a given perturbation 
may switch between periods of growth and stasis. These effects help to govern the form 
of the perturbation spectrum that propagates to the present universe from early times. 



The general case How does the matter-dominated growth S{a) oc a change at late 
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times when 7^ 1? The differential equation for 6 is as before, but a{t) is altered. 
Provided the vacuum equation of state is exactly p = —pc?, or if the vacuum energy is 
negligible, the solutions to the growth equations can be written as 

(5 oc I "^"/"^ /o"(^)"^ (growing mode) ^24) 

[ (a/a) (decaying mode). 

(Heath 1977; see also section 10 of Peebles 1980). For the most general case, e.g. a 
vacuum with time-varying density, these do not apply, and the differential equation for 
5 must be integrated directly. 

In any case, the equation for the growing mode requires numerical integration unless 
the vacuum energy vanishes. A very good approximation to the answer is given by Carroll 
et al. (1992): 

8'}ZT=i) ^ K - ^ ^ ^ " • ^''^ 

This fitting formula for the growth suppression in low-density universes is an invaluable 
practical tool. For flat models with + = 1) it says that the growth suppression 
is less marked than for an open universe - approximately as against Q^-^^ in the 

A = case. This reflects the more rapid variation of f2„ with redshift; if the cosmological 
constant is important dynamically, this only became so very recently, and the universe 
spent more of its history in a nearly Einstein-de Sitter state by comparison with an open 
universe of the same Sim- 

2.2. Radiation- dominated universe 

At early enough times, the universe was radiation dominated {cg = c/\/3) and the 
analysis so far does not apply. It is common to resort to general relativity perturbation 
theory at this point. However, the fields are still weak, and so it is possible to generate 
the results we need by using special relativity fluid mechanics and Newtonian gravity 
with a relativistic source term: 

V^* = 4ttG{p + Sp/c^), (26) 

in Eulerian units. The main change from the previous treatment come from factors of 
2 and 4/3 due to this {p + Sp/c"^) term, and other contributions of the pressure to the 
relativistic equation of motion. The resulting evolution equation for S is 

5 + 2U='^Gpo5, (27) 

so the net result of all the relativistic corrections is a driving term on the rhs that is a 
factor 8/3 higher than in the matter-dominated case (see e.g. Section 15.2 of Peacock 
1999 for the details). 

In both matter- and radiation-dominated universes with O = 1, we have po oc l/t^: 

2 

matter domination (o oc t^^^) : AnGpo = — ^ 

3* ^ (28) 
radiation domination (a oc t^^"^) : 327rGpo/3 = -tt- 

Every term in the equation for 5 is thus the product of derivatives of 5 and powers of 
t, and a power-law solution is obviously possible. If we try 6 oc f", then the result is 
n = 2/3 or —1 for matter domination; for radiation domination, this becomes n = ±1. 
For the growing mode, these can be combined rather conveniently using the conformal 
time T] = J dt/a: 

6 oc rf. (29) 
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The quantity 77 is proportional to the comoving size of the cosmological particle horizon. 

One further way of stating this result is that gravitational potential perturbations are 
independent of time (at least while Vl = 1). Poisson's equation tells us that — /c^<f)/a^ cx 
p5] since p (X for matter domination or for radiation, that gives $ cx (5/a or 
S/a^ respectively, so that $ is independent of a in either case. In other words, the metric 
fluctuations resulting from potential perturbations are frozen, at least for perturbations 
with wavelengths greater than the horizon size. 

2.3. Meszdros effect 

What about the case of collisionless matter in a radiation background? The fluid treat- 
ment is not appropriate here, since the two species of particles can interpenetrate. A 
particularly interesting limit is for perturbations well inside the horizon: the radiation 
can then be treated as a smooth, unclustcrcd background that affects only the overall 
expansion rate. This is analogous to the effect of A, but an analytical solution does exist 
in this case. The perturbation equation is as before 

5 + 2^6 = 'InOpmS, (30) 

but now H"^ = 8TrG{pm + pr)/'^- If we change variable to y = pm/Pr = a/ueq, and use 
the Friedmann equation, then the growth equation becomes 

(for A; = 0, as appropriate for early times). It may be seen by inspection that a growing 
solution exists with 5" = 0: 

(5 ay + 2/3. (32) 

It is also possible to derive the decaying mode. This is simple in the radiation-dominated 
case (2/ <C 1): (5 oc — Iny is easily seen to be an approximate solution in this limit. 

What this says is that, at early times, the dominant energy of radiation drives the 
universe to expand so fast that the matter has no time to respond, and S is frozen 
at a constant value. At late times, the radiation becomes negligible, and the growth 
increases smoothly to the Einstein-de Sitter 5 oc a behaviour (Meszaros 1974). The 
overall behaviour is therefore similar to the effects of pressure on a coupled fluid: for scales 
greater than the horizon, perturbations in matter and radiation can grow together, but 
this growth ceases once the perturbations enter the horizon. However, the explanations 
of these two phenomena arc completely different. In the fluid case, the radiation pressure 
prevents the perturbations from collapsing further; in the collisionless case, the photons 
have free-streamed away, and the matter perturbation fails to collapse only because 
radiation domination ensures that the universe expands too quickly for the matter to have 
time to self-gravitate. Because matter perturbations enter the horizon (at y = gentry) 
with (5 > 0, 5 is not frozen quite at the horizon-entry value, and continues to grow until 
this initial 'velocity' is rcdshiftcd away, giving a total boost factor of roughly Inr/cniry 
This log factor may be seen below in the fltting formulae for the CDM power spectrum. 

2.4. Coupled perturbations 

We will often be concerned with the evolution of perturbations in a universe that contains 
several distinct components (radiation, baryons, dark matter). It is easy to treat such a 
mixture if only gravity is important (i.e. for large wavelengths). Look at the perturbation 
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equation in the form 

q2 2a d 

L6 = driving term, dt^ ~^di' ^^^^ 

The rhs represents the effects of gravity, and particles will respond to gravity whatever 
its source. The coupled equations for several species are thus given by summing the 
driving terms for all species. 



Matter plus radiation The only subtlety is that wc must take into account the 
peculiarity that radiation and pressureless matter respond to gravity in different ways, 
as seen in the equations of fluid mechanics. The coupled equations for perturbation 
growth are thus 

Solutions to this will be simple if the matrix has time-independent eigenvectors. Only 
one of these is in fact time independent: (1,4/3). This is the adiabatic mode in which 
Sr = 45^/3 at all times. This corresponds to some initial disturbance in which matter 
particles and photons are compressed together. The entropy per baryon is unchanged, 
6{T^)/{T^) = 6m, hence the name 'adiabatic'. In this case, the perturbation amplitude 
for both species obeys L5 = 4:TrG{pm + 8pr/3)6. We also expect the baryons and photons 
to obey this adiabatic relation very closely even on small scales: the tight coupling 
approximation says that Thomson scattering is very effective at suppressing motion of 
the photon and baryon fluids relative to each other. 



Isocurvature modes The other perturbation mode is harder to see until we realize 

that, whatever initial conditions wc choose for Sr and Sm, any subsequent changes to 
matter and radiation on large scales must be adiabatic (only gravity is acting). Suppose 
that the radiation field is initially chosen to be uniform; we then have 

4 

= ^{Sm- Si), (35) 



where Si is some initial value of 6m 

327rG 

J^Om = = 



The equation for Sm becomes 

m PrSi 



3 

Pr + oP" 



(36) 



which is as before if Si = 0. The other solution is therefore a particular integral with 
S (X Si. For fl = 1, the answer can be expressed most neatly in terms of y = Pm/ Pr 
(Peebles 1987): 

y y^ (37) 

Sr/S, = A(Sm/S, - l)/3 ~ -2?y/3 + • • • 

At late times, Sm 0, while Sr — 4(5j/3. This mode is called the isocurvature mode, 
since it corresponds to a total density perturbation Sp/p — > as — > 0. Subsequent evo- 
lution attempts to preserve constant density by making the matter perturbations decrease 
while the amplitude of Sr increases. An alternative name for this mode is an entropy 
perturbation. This reflects the fact that one only perturbs the initial ratio of photon and 
matter number densities. The late-time evolution is then easily understood: causality re- 
quires that, on large scales, the initial entropy perturbation is not altered. Hence, as the 
universe becomes strongly matter dominated, the entropy perturbation becomes carried 
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entirely by the photons. This leads to an increased amplitude of microwave-background 
anisotropies in isocurvature models (Efstathiou & Bond 1986), which is one reason why 
such models arc not popular. Of course, a small admixture of isocurvature perturbations 
is always going to be hard to rule out (e.g. Bucher, Moodley & Turok 2002), so neglect of 
this mode is primarily justified by the fact that the simplest model for the generation of 
cosmological perturbations (single-field inflation) produces pure adiabatic modes. Mod- 
els with multiple fields, such as the decaying curvaton of Lyth & Wands (2002) tend to 
generate order-unity isocurvature contributions, which are impossible to reconcile with 
CMB data (e.g. Gordon & Lewis 2002). 



(38) 



Baryons and dark matter This case is simpler, because both components have the 
same equation of state: 

Both eigenvectors are time independent: (1, 1) and (fid, —ilb)- The time dependence of 
these modes is easy to see for an il = 1 matter-dominated universe: if we try (5 oc f", then 
we obtain respectively n = 2/3or— 1 and n = or —1/3 for the two modes. Hence, if we 
set up a perturbation with Sb = 0, this mixture of the eigenstates will quickly evolve to 
be dominated by the fastest-growing mode with Si, = Sd,: the baryonic matter falls into 
the dark potential wells. This is one process that allows universes containing dark matter 
to produce smaller anisotropies in the microwave background: radiation drag allows the 
dark matter to undergo growth between matter-radiation equality and recombination, 
while the baryons are held back. 

This is the solution on large scales, where pressure effects are negligible. On small 
scales, the effect of pressure will prevent the baryons from continuing to follow the dark 
matter. Wc can analyse this by writing down the coupled equation for the baryons, but 
now adding in the pressure term (sticking to the matter-dominated era, to keep things 
simple) : 

L5b = L5d-k'^c%/a^. (39) 

In the limit that dark matter dominates the gravity, the first term on the rhs can be 

taken as an imposed driving term, of order 5d/f^. In the absence of pressure, we saw 
that 5b and 5d grow together, in which case the second term on the rhs is smaller than 
the first if kcst/a <C 1. Conversely, for large wavenumbers {kcst/a 1), baryon pressure 
causes the growth rates in the baryons and dark matter to differ; the main behaviour 
of the baryons will then be slowly declining sound waves, and we can write the WKB 
solution. 



Sb oc (acs) ^^"^ exp 



zti / fccs drj 



(40) 



where rj is conformal time. An alternative way to see that the baryons are damped is to 
write the coupled equations as 

L(f] = '-^(\,^' (41) 



(5d / fl \ Oj, Qd J \Sd 
where k = k/kj. In the special case fit — > and k = constant, a solution is clearly 

and this is found to be the asymptotic solution in more general cases (Nusser 2000). 
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This oscillatory behaviour holds so long as pressure forces continue to be important. 
However, the sound speed drops by a large factor at recombination, and we would then 
expect the oscillatory mode to match on to a mixture of the pressure-free growing and 
decaying modes. This behaviour can be illustrated in a simple model where the sound 
speed is constant until recombination at conformal time r]r and then instantly drops to 
zero. The behaviour of the density field before and after tr may be written as 

j So sin (wr?)/ (wry) {rj < r]r) , .„^ 

where w = kcs- Matching 6 and its time derivative on either side of the transition allows 
the decaying component to be eliminated, giving the following relation between the 
growing-mode amplitude after the transition to the amplitude of the initial oscillation: 

Ar]l = y cosa;??^. (44) 

The amplitude of the growing mode after recombination depends on the phase of the 
oscillation at the time of recombination. The output is maximised when the input den- 
sity perturbation is zero and the wave consists of a pure velocity perturbation; this 
effect is known as velocity overshoot. The post-recombination transfer function will thus 
display oscillatory features, peaking for wavenumbers that had particularly small am- 
plitudes prior to recombination. Such effects can be seen at work in determining the 
relative positions of small-scale features in the power spectra of matter fluctuations and 
microwave-background fluctuations. 

2.5. Transfer functions and characteristic scales 

The transfer function for models with the full above list of ingredients was first computed 
accurately by Bond & Szalay (1983), and is today routinely available via public-domain 
codes such as CMBFAST (Seljak & Zaldarriaga 1996). These calculations are a techni- 
cal challenge because we have a mixture of matter (both collisionless dark particles and 
baryonic plasma) and relativistic particles (collisionless neutrinos and collisional pho- 
tons), which does not behave as a simple fluid. Particular problems are caused by the 
change in the photon component from being a fluid tightly coupled to the baryons by 
Thomson scattering, to being collisionless after recombination. Accurate results require 
a solution of the Boltzmann equation to follow the evolution in detail. 

Some illustrative results are shown in figure 2. Leaving aside the isocurvature models, 
all adiabatic cases have T — > 1 on large scales - i.e. there is growth at the universal 
rate (which is such that the amplitude of potential perturbations is constant until the 
vacuum starts to be important at z < 1). The different shapes of the functions can be 
understood intuitively in terms of a few special length scales, as follows: 

(1) Horizon length at matter- radiation equality. The main bend visible in 
all transfer functions is due to the Meszaros effect, which arises because the universe 
is radiation dominated at early times. Fluctuations in the matter can only grow if 
dark matter and radiation fall together. This does not happen for perturbations of 
small wavelength, because photons and matter can separate. Growth only occurs for 
perturbations of wavelength larger than the horizon distance, where there has been no 
time for the matter and radiation to separate. The relative diminution in fluctuations 
at high k is the amount of growth missed out on between horizon entry and Zeq, which 
would be ^ oc D| in the absence of the Meszaros effect. Perturbations with larger k 
enter the horizon when _Dh — 1/fc; they are then frozen until Zoq, at which point they 
can grow again. The missing growth factor is just the square of the change in during 
this period, which is oc k^. The approximate limits of the CDM transfer function are 
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Figure 2. A plot of transfer functions for various adiabatic models, in which Tfc — > 1 at small 
k. A number of possible matter contents are illustrated: pure baryons; pure CDM; pure HDM. 
For dark-matter models, the characteristic wavermmbcr scales proportional to Qmh^, marking 
the break scale corresponding to the horizon length at matter-radiation equality. The scaling for 
baryonic models does not obey this exactly; the plotted case corresponds to = 1, h = 0.5. 



therefore 



fc£>H(-2eq) > 1- 



(45) 



This process continues, until z^q = 23 900 ri„i/i^ , where the universe becomes matter 
dominated. We therefore expect a characteristic 'break' in the fluctuation spectrum 
around tlie comoving horizon length at this time. Using a distance-redshift relation that 
ignores vacuum energy at high z, 

dz 



Rodr = 



Ho (1 + z)^l + nmZ+{l + zyflr' 



we obtain 



2c 



-Wo 



/2 



(46) 



(47) 



Since distances in cosmology always scale as h~^, this means that O^/i should be ob- 
servable. 

(2) Free-streaming length. This relatively gentle filtering away of the initial fluctu- 
ations is all that applies to a universe dominated by Cold Dark Matter, in which random 
velocities are negligible. A CDM universe thus contains fluctuations in the dark matter 
on all scales, and structure formation proceeds via hierarchical process in which nonlinear 
structures grow via mergers. 

Examples of CDM would be thermal relic WIMPs with masses of order 100 GeV. Relic 
particles that were never in equilibrium, such as axions, also come under this heading, as 
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do more exotic possibilities such as primordial black holes. A more interesting case arises 
when thermal relics have lower masses. For collisionless dark matter, perturbations can 
be erased simply by free streaming: random particle velocities cause blobs to disperse. 
At early times {kT > mc?'), the particles will travel at c, and so any perturbation that 
has entered the horizon will be damped. This process switches off when the particles 
become non-relativistic, so that perturbations are erased up to proper lengthscales of 
ct{kT = m(?). This translates to a comoving horizon scale {2ct/a during the radiation 
era) at kT = mc? of 

-^frce— stream — 

112(m/eV)~iMpc (48) 

(in detail, the appropriate figure for neutrinos will be smaller by (4/11)^/^ since they 
have a smaller temperature than the photons). A light neutrino-like relic that decouples 
while it is relativistic satisfies 

Vl^h? =m/92,.^cN (49) 

Thus, the damping scale for HDM (Hot Dark Matter) is of order the bend scale. Alter- 
natively, if the particle decouples sufficiently early, its relative number density is boosted 
by annihilations, so that the critical particle mass to make ri„i = 1 can be boosted to 
around 1-10 keV (Warm Dark Matter). The existence of galaxies at 2; ~ 6 tells us that 
the coherence scale must have been below about 100 kpc, so WDM is close to being ruled 
out. A similar constraint is obtained from small-scale structure in the Lyman-alpha forest 
(Narayanan et al. 2000): m > 0.75 kcV. 

A more interesting (and probably practically relevant) case is when the dark matter is 
a mixture of hot and cold components. The free-streaming length for the hot component 
can therefore be very large, but within range of observations. The dispersal of HDM 
fluctuations reduces the CDM growth rate on all scales below Lfree-stream - or, relative 
to small scales, there is an enhancement in large-scale power. 

(3) Acoustic horizon length. The horizon at matter-radiation equality also enters 
in the properties of the baryon component. Since the sound speed is of order c, the largest 
scales that can undergo a single acoustic oscillation are of order the horizon. The transfer 
function for a pure baryon universe shows large modulations, reflecting the number of 
oscillations that have been completed before the universe becomes matter dominated and 
the pressiire support drops. The lack of such large modulations in real data is one of 
the most generic reasons for believing in collisionless dark matter. Acoustic oscillations 
persist even when baryons are subdominant, however, and can be detectable as lower- 
level modulations in the transfer function (e.g. Goldberg & Strauss 1998; Meiksin et al. 
1999). 

(4) Silk damping length. Acoustic oscillations are also damped on small scales, 

where the process is called Silk damping: the mean free path of photons due to scattering 
by the plasma is non-zero, and so radiation can diffuse out of a perturbation, convecting 
the plasma with it. The typical distance of a random walk in terms of the diffusion 
coefScient is a; ~ V^Dt, which gives a damping length of 

As ^ ^/>^, (50) 

the geometric mean of the horizon size and the mean free path. Since A = l/(na-Y) = 
44.3(1 + z)~^ {ilbh^)~^ proper Gpc, we obtain a comoving damping length of 

As = 16.3 (1 + zy^/^{nln^h^)-^/^ Gpc. (51) 

This becomes close to the horizon length by the time of last scattering, 1 -f- 2; ~ 1100. 
The resulting damping effect can be seen in figure 2 at fc ^ lOfcn- 
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Fitting formulae It is invaluable in practice to have some accurate analytic formulae 
that fit the numerical results for transfer functions. We give below results for some 
common models of particular interest (illustrated in figure 2, along with other cases 
where a fitting formula is impractical). For the models with collisionless dark matter, 
<C is assumed, so that all lengths scale with the horizon size at matter-radiation 
equality, leading to the definition 

q= r. (52) 

We consider the following cases: (1) Adiabatic CDM; (2) Adiabatic massive neutrinos 
(1 massive, 2 massless); (3) Isocurvature CDM; these expressions come from Bardeen et 
al. (1986; BBKS). Since the characteristic length-scale in the transfer fimction depends 
on the horizon size at matter-radiation equality, the temperature of the CMB enters. In 
the above formulae, it is assumed to be exactly 2.7 K; for other values, the characteristic 
wavenumbcrs scale oc T"^. For these purposes massless neutrinos count as radiation, 
and three species of these contribute a total density that is 0.68 that of the photons. 

(1) n = ^"^^+3^^^^'^ [1 + 3.89g + {IQ.lqf + {hAQqf + [Q.nqf] 

(2) Tfe = exp(-3.99-2.1g2) (53) 

/ r , Tl.24\ -1/1-24 

(3) Tfc = (5.65)M 1+ ^15.0g+(0.9q)3/2 + (5.6g)2j J 

The case of mixed dark matter (MDM: a mixture of massive neutrinos and CDM) is 
more complex. Ma (1996) gives the following expression: 



T 

MDM 



1 + (0.081a;)i-630 + (0.040a:)3-259i f^i °V2 



(54) 



1 + (2.080x0)3-259 

where x = k/T^, T,^ = a^^'^fl^h^ and xq is the value of x at a = 1. The scale-factor 

dependence is such that the damping from neutrino free-streaming is less severe at high 
redshift, but the spectrum is very nearly of constant shape for z < 10. See Pogosyan & 
Starobinsky (1995) for a more complicated fit of higher accuracy. 

These expressions are useful for work at a level of 10% precision, but increasingly 
it is necessary to do better. In particular, these expressions do not include the weak 
oscillatory features that are expected if the universe has a significant baryon content. 
Eisenstein & Hu (1998) give an accurate (but long) fitting formula that describes these 
wiggles for the CDM transfer function. This was extended to cover MDM in Eisenstein 
& Hu (1999). 



3. Nonlinear evolution of cosmic structure 

The equations of motion are nonlinear, and we have only solved them in the limit of 
linear perturbations. We now discuss evolution beyond the linear regime, first for a few 
special density models, and then considering full numerical solution of the equations of 
motion. 

3.1. The Zeldovich approximation 

Zeldovich (1970) invented a kinematical approach to the formation of structure. In this 
method, we work out the initial displacement of particles and assume that they continue 
to move in this initial direction. Thus, we write for the proper coordinate of a given 
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particle 

x(t) =a(t)q + 6(t)f(q). (55) 

This looks like Hubble expansion with some perturbation, which will become negligible 
as t ^ 0. The coordinates q are therefore equal to the usual comoving coordinates at 
t = 0, and b{t) is a function that scales the time- independent displacement field f(q). In 
fluid-mechanical terminology, x is said to be the Eulerian position, and q the Lagrangian 
position. 

To get the; Eulerian density, we need the Jacobian of the transformation between x 
and q, in which frame p is constant. This strain tensor is symmetric, provided we assume 
that the density perturbation originated from a growing mode. The displacement fleld 
is then irrotational, so that we can write it in terms of a potential 

f(q)=VV^(q) ^ 1^ = /^- (56) 

The strain tensor is thus characterized by its three eigenvalues, and the density becomes 
infinite when the most negative eigenvalue reaches —1. 

If we linearize the density relation, then the relation to density perturbations is 

5 = -^V-f. (57) 

This is first-order Lagrangian perturbation theory, in contrast to the earlier approach, 
which carried out perturbation theory in Eulerian space (higher-order Lagrangian theory 
is discussed by Bouchet et al. 1995). When the density fluctuations are small, a first- 
order treatment from either point of view should give the same result. Since the linearized 
density relation is (5 = — (6/a)V • f, wc can tell immediately that [b{t)/a{t)] = D{t), where 
D[t) is the linear density growth law. Without doing any more work, we therefore know 
that the first-order form of Lagrangian perturbations must be 

x(t)=a(i)[q + £>(t)f(q)], (58) 

so that b{t) = a{t)D{t). The advantage of the Zeldovich approximation is that it normally 
breaks down later than Eulerian linear theory - i.e. first-order Lagrangian perturbation 
theory can give results comparable in accuracy to Eulerian theory with higher-order terms 
included. This method is therefore commonly used to set up quasi-linear initial conditions 
for A'^-body simulations, as discussed below. The same arguments that we used earlier in 
discussing peculiar velocities show that the growing-mode comoving displacement field f 
is parallel to k for a given Fourier mode, so that 

ffc = -i ^ k. (59) 

Given the desired linear density mode amplitudes, the corresponding displacement field 
can then be constructed. 



3.2. The spherical model 

An overdense sphere is a very useful nonlinear model, as it behaves in exactly the same 
way as a closed sub-universe. The density perturbation need not be a uniform sphere: 
any spherically symmetric perturbation will clearly evolve at a given radius in the same 
way as a uniform sphere containing the same amount of mass. In what follows, therefore, 
density refers to the mean density inside a given sphere. The equations of motion are 
the same as for the scale factor, and we can therefore write down the cycloid solution 
immediately. For a matter-dominated universe, the relation between the proper radius 
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of the sphere and time is 

. i (60 

t = 5(6' -sin 6*), ^ ^ 

and A'^ — GMB^, just from r = —GM/r"^. Expanding these relations up to order 9^ 
gives r{t) for smaU t: 

2/3" 



^~^\b) ~ 20 
and we can identify the density perturbation within the sphere: 

3 /6t^^/^ 



(61) 



This ah agrees with what we knew already: at early times the sphere expands with the 
a oc t^/^ Hubble flow and density perturbations grow proportional to a. 

Wc can now sec how linear theory breaks down as the perturbation evolves. There 
are three interesting epochs in the final stages of its development, which we can read 
directly from the above solutions. Here, to keep things simple, we compare only with 
linear theory for an O = 1 background. 

(1) Turnround. The sphere breaks away from the general expansion and reaches a 
maximum radius at = tt, i = ttB. At this point, the true density enhancement 
with respect to the background is just [A{Qt/ Bfl^ /2f /r^ = QTr^/ie ~ 5.55. By 
comparison, extrapolation of linear 5 oc t^/^ theory predicts 5ii„ = (3/20) (67r)^/^ ~ 
1.06. 

(2) Collapse. If only gravity operates, then the sphere will collapse to a singularity 
at 6* = 27r. This occurs when 6^^ = (3/20)(127r)2/3 ^ ^ 59 

(3) Virialization. Clearly, collapse will never occur in practice; dissipative physics 
will eventually intervene and convert the kinetic energy of collapse into random 
motions. How dense will the resulting body be? Consider the time at which the 
sphere has collapsed by a factor 2 from maximum expansion. At this point, it has 
kinetic energy K related to potential energy V hyV = ~2K . This is the condition 
for equilibrium, according to the virial theorem. For this reason, many workers 
take this epoch as indicating the sort of density contrast to be expected as the 
endpoint of gravitational collapse. This occurs s.t 6 = 37r/2, and the corresponding 
density enhancement is (97r + 6)^/8 ~ 147, with (Jun ~ 1.58. Some authors prefer 
to assume that this virialized size is eventually achieved only at collapse, in which 
case the contrast becomes (67r)^/2 ~ 178. 

These calculations are the basis for a common 'rule of thumb', whereby one assumes that 
linear theory applies until (5iin is equal to some 5c a little greater than unity, at which point 
virialization is deemed to have occurred. Although the above only applies for = 1, 
analogous results can be worked out from the full (5iin(z, fi) and f(z, O) relations; (Jun — 1 
is a good criterion for collapse for any value of f2 likely to be of practical relevance. The 
full density contrast at virialization may be approximated by 

l + 5^i, ~178fi-°-^(ivir) (63) 

(although open models show a slightly stronger dependence on f7„j than flat A-dominated 
models; Eke et al. 1996). The faster expansion of low-density universes means that, by 
the time a perturbation has turned round and collapsed to its final radius, a larger 
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density contrast has been produced. For real non-spherical systems, it is not clear that 
this distinction is meaningful, and in practice a density contrast of around 200 is used to 
define the virial radius that marks the boundary of an object. 

3.3. N-body models 

The exact evolution of the density field is usually performed by means of an N-body 
simulation, in which the density field is represented by the sum of a set of fictitious 
discrete particles. The equations of motion for each particle depend on solving for the 
gravitational field due to all the other particles, finding the change in particle positions 
and velocities over some small time step, moving and accelerating the particles, and 
finally re-calculating the gravitational field to start a new iteration. Using comoving 
units for length and velocity (v = au), wc have previously seen the equation of motion 

^u = -2^u-lv$, (64) 
dt a a'' 

where $ is the Newtonian gravitational potential due to density perturbations. The time 

derivative is already in the required form of the convectivc time derivative observed by 
a particle, rather than the partial d/dt. If we change time variable from t to a, this 
becomes 

d , 2 \ o, G Xi — X 

dh^("") = H^ = ^ 4^"^^R^- 

Here, the gravitational acceleration has been written exactly by summing over all par- 
ticles, but this becomes prohibitive for very large numbers of particles. Since the prob- 
lem is to solve Poisson's equation, a faster approach is to use Fourier methods, since 
this allows the use of the fast Fourier transform (FFT) algorithm (see chapter 13 of 
Press et al. 1992). If the density perturbation field (not assumed small) is expressed as 
5 = X] (^fc exp(— zk • x), then Poisson's equation becomes —k^^k = 4:'jTGa^pSk, and the 
required fc-space components of V$ are just 

(V$)fc = -i$fck = p— ^ 6k k. (66) 

If we finally eliminate matter density in terms of Clm , the equation of motion for a given 
particle is 

^ (a^u) = ^ F, exp(-ik • x), = -ik ^^^^ 6^. (67) 

This can be expressed more neatly by defining dimensionless units that incorporate 
the the side of the box, L: 

X = x/i 

U = 5v/{HLa) = u/HL. ^^^^ 

For N particles, the density is p = Nm/{aL)^, so the mass of the particles and the 
gravitational constant can be eliminated and the equation of motion can be cast in an 
attractively dimensionless form: 

The fmiction /(o) is proportional to a^H{a), and has an arbitrary normalization - e.g. 
unity at the initial epoch. 

Particles are now moved according to dx = u dt, which becomes 

rfX = Urflna. (70) 
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Figure 3. Slices through AT-body simulations of different power spectra, using the same set 
of random phases for the modes in both cases. The slices are 1/15 of the box in thickness, 

and density from 0.5 to 50 is log encoded. The box-sc:;ilo power is the same in both cases, 
and produces much the same large-scale filamentary structure. However, the n = spectrum 
has much more small-scale power, and this manifests itself as far stronger clumping within the 
overall skeleton of the structure. 

It only remains to set up the initial conditions; this is easy to do if the initial epoch is 
at high enough redshift that = 1, since then U a a and the earlier discussion of 
Lagrangian perturbations shows that velocities and the initial displacements are related 

by 

AX = U. (71) 

The simplest iV-body algorithm for solving the equations of motion is the particle -mesh 
(PM) code. This averages the density field onto a grid and uses the FFT algorithm both 
to perform the transformation of density and to perform the (three) inverse transforms to 
obtain the real-space force components from their fc-space counterparts (sec Hockncy & 
Eastwood 1988; Efstathiou et al. 1985). The resolution of a PM code is clearly limited to 
about the size of the mesh. To do better, one can use a particle particlc-particle-mesh 
(P^M) code, also discussed by the above authors. Here, the direct forces are evaluated 
between particles in the nearby cells, with the grid estimate being used only for particles 
in more distant cells. An alternative approach is to use adaptive mesh codes, in which 
high-density regions are re-gridded to use a finer mesh (e.g. Kravtsov, Klypin & Khokhlov 
1997). A similar effect, although without the use of a mesh, is achieved by tree codes 
(e.g. Hernquist, Bouchet & Suto 1991). 

In practice, however, the increase in resolution gained from these methods is limited 
to a factor of < 10. This is because each particle in a cosmological A^-body simulation 
in fact stands for a large number of less massive particles. Close encounters of these 
spuriously large particles can lead to wide-angle scattering, whereas the true physical 
systems are completely collisionless. To prevent collisions, the forces must be softened, 
i.e. set to a constant below some critical separation, rather than rising as 1/r^. If there 
are already few particles per PM cell, the softening must be some significant fraction of 
the cell size, so there is a limit to the gain over pure PM. 

Despite these caveats, the results of A/'-body dynamics paint an impressive picture 
of the large-scale mass distribution. Consider figure 3, which shows slices through the 
computational density field for two particular sets of initial conditions, with different 
relative amplitudes of long and short wavelengths, but with the same amplitude for the 
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modes with wavelengths equal to the side of the computational box. Although the small- 
scale 'lumpiness' is different, both display a similar large-scale network of filaments and 
voids - bearing a striking resemblance to the features seen in reality. 

The state of the art in these calculations now routinely involves 10^ to 10^ particles, 
covering box sizes from the minimum necessary so that the box-scale modes do not sat- 
urate (~ 100 /i""'^ Mpc) to effectively the entire observable universe (e.g. Evrard et al. 
2002). The resolution available in the smaller boxes is sufficient that the nonlinear evolu- 
tion of collisionless mass distributions is now effectively a solved problem, and nonlinear 
clustering statistics for model universes of practical interest can be measured to a few % 
precision (e.g. Jenkins et al. 1998). Further improvements in these sort of calculations 
are unlikely to be of practical importance, because of the need to include the evolution 
of the baryonic component, which makes up around 20% of the total matter density. 
The history of the gas is immensely complex, since it is strongly inffuenced by feedback 
of energy from the stars that form within it. The limitation of our modelling of such 
processes lies not so much in simple numerical aspects such as resolution, but in the 
simplifying assumptions used to treat processes that occur on scales very far below the 
resolution of any simulation. See e.g. Katz, Weinberg & Hernquist (1996); Pearce et al. 
(2001). 



4. Statistics of cosmological density fields 

Having discussed the main elements of the theory of cosmological structure formation, 
wc now turn to the statistical treatment of data - which is how theory and observation 
will be confronted. The density perturbation field, 6, inhabits a universe that is isotropic 
and homogeneous in its large-scale properties, suggesting that the statistical properties 
of S should also be homogeneous. This statement sounds contradictory, and yet it makes 
perfect sense if there exists an ensemble of universes. The concept of an ensemble is used 
every time we apply probability theory to an event such as tossing a coin: we imagine 
an infinite sequence of repeated trials, half of which result in heads, half in tails. The 
analogy of coin tossing in cosmology is that the density at a given point in space will 
have different values in each member of the ensemble, with some overall variance (5^) 
between members of the ensemble. Statistical homogeneity of the S field then means that 
this variance must be independent of position. The actual field found in a given member 
of the ensemble is a realization of the statistical process. 

There arc two problems with this line of argument: (i) wc have no evidence that 
the ensemble exists; (ii) in any case, we only get to observe one realization, so how is 
the variance (i5^) to be measured? The first objection applies to coin tossing, and may 
be evaded if we understand the physics that generates the statistical process we only 
need to imagine tossing the coin many times, and we do not actually need to perform the 
exercise. The best that can be done in answering the second objection is to look at widely 
separated parts of space, since the 5 fields there should be causally imconnected; this is 
therefore as good as taking measurements from two different member of the ensemble. 
In other words, if we measure the variance {S"^) by averaging over a sufficiently large 
volume, the results would be expected to approach the true ensemble variance, and the 
averaging operator (• • •) is often used without being specific about which kind of average 
is intended. Fields that satisfy this property, whereby 

volume average ensemble average (72) 

are termed ergodic. Giving a formal proof of ergodicity for a random process is not 
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always easy (Adler 1981); in cosmology it is perhaps best regarded as a common-sense 
axiom. 

4.1. Fourier analysis of density fluctuations 

It is often convenient to consider building up a general field by the superposition of many 
modes. For a flat comoving geometry, the natural tool for achieving this is via Fourier 
analysis. For other models, plane waves arc not a complete set and one should use instead 
the eigenfunctions of the wave equation in a curved space. Normally this complication 
is neglected: even in an open universe, the difference only matters on scales of order the 
present-day horizon. 

How do we make a Fourier expansion of the density field in an infinite universe? If the 
field were periodic within some box of side L, then we would just have a sum over wave 
modes: 

F(x)=^^ke-*•^ (73) 

The requirement of periodicity restricts the allowed wavenumbers to harmonic boundary 
conditions 

27r 

kx = n—, n = l,2---, (74) 

with similar expressions for ky and k^. Now, if we let the box become arbitrarily large, 
then the sum will go over to an integral that incorporates the density of states in A:-space, 
exactly as in statistical mechanics; this is how the general idea of the Fourier transform 
is derived. The Fourier relations in n dimensions are thus 



Fix) = (^^^ J Fk{k) exp(-ik • x) d^k 
Fk{k) = (^j^ j F{x) exp(*k • x) 



(75) 



One advantage of this particular Fourier convention is that the definition of convolution 
involves just a simple volume average, with no gratuitous factors of (27r)~^/^: 

f^g=l-jf{^-y)g{y)d^y. (76) 

Although one can make all the manipulations on density fields that follow using either 

the integral or sum formulations, it is usually easier to use the sum. This saves having 
to introduce (5-functions in /c-space. For example, if we have / = ^ exp(— z/cx), the 
obvious way to extract fk is via fk = (l/L) J f exp{ikx) dx: because of the harmonic 
boundary conditions, all oscillatory terms in the sum integrate to zero, leaving only fk to 
be integrated from to L. There is less chance of committing errors of factors of 2tt in this 
way than considering / = {L/2'k) J fk ex-p{—ikx) dk and then using J exp[i{k—K)x] dx = 
2-K6n{k-K). 



Correlation functions and power spectra As an immediate example of the Fourier 
machinery in action, consider the important quantity 

C(r)^(5(x)5(x + r)), (77) 

which is the autocorrelation function of the density field usually referred to simply as 
the correlation function. The angle brackets indicate an averaging over the normalization 
volume V. Now express ^ as a sum and note that 5 is real, so that we can replace one of 
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the two 5's by its complex conjugate, obtaining 

^=l^Y^ (5k5^,e'('''-''>"e-*--^ . (78) 

Alternatively, this sum can be obtained without replacing {55) by {55*), from the relation 
between modes with opposite wavevectors that holds for any real field: (5k(— k) = 5^{k). 
Now, by the periodic boundary conditions, all the cross terms with k' ^ k average to 
zero. Expressing the remaining sum as an integral, we have 

^^■"^ = (2^ / l^kl'e-^-rf^fc. (79) 

In short, the correlation function is the Fourier transform of the power spectrum. This 

relation has been obtained by volume averaging, so it applies to the specific mode am- 
plitudes and correlation function measured in any given realization of the density field. 
Taking ensemble averages of each side, the relation clearly also holds for the ensemble 
average power and correlations which are really the quantities that cosmological studies 
aim to measure. We shall hereafter often use the alternative notation 

P(fc) = (|4p) (80) 

for the ensemble-average power (although this only applies for a Fourier series with 
discrete modes). The distinction between the ensemble average and the actual power 
measured in a realization is clarified below in the section on Gaussian fields. 

In an isotropic universe, the density perturbation spectrum cannot contain a preferred 
direction, and so we must have an isotropic power spectrum: (|5k|^(k)) = |^fc|^(A;). The 
angular part of the fc-space integral can therefore be performed immediately: introduce 
spherical polars with the polar axis along k, and use the reality of ^ so that e'^^^'^ 
cos{krcos6). In three dimensions, this yields 

^(^)=(2^/^('^^?'"'''^'- 
The 2D analogue of this formula is 

^r) = J P{k) Jo(fcr) 27rfc dk. (82) 

We shall usually express the power spectrum in dimensionless form, as the variance 
per Infc {A'^{k) = d{5'^)ld\nk oc k^P[k]): 



(83) 



This gives a more easily visualizable meaning to the power spectrum than does the 
quantity VP{k), which has dimensions of volume: A^(fc) = 1 means that there are 
order-unity density fluctuations from modes in the logarithmic bin around wavenumber k. 

(fc) is therefore the natural choice for a Fourier-space counterpart to the dimensionless 
quantity ^(r). 



Power-law spectra The above shows that the power spectrum is a central quantity 
in cosmology, but how can we predict its functional form? For decades, this was thought 
to be impossible, and so a minimal set of assumptions was investigated. In the absence of 
a physical theory, we should not assume that the spectrum contains any preferred length 
scale, otherwise we should then be compelled to explain this feature. Consequently, the 
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spectrum must be a featureless power law: 

(|4|')(xfc" (84) 

The index n governs the balance between large- and small-scale power. The meaning 
of different values of n can be seen by imagining the results of filtering the density 
field by passing over it a box of some characteristic comoving size x and averaging 
the density over the box. This will filter out waves with k > 1/x, leaving a variance 
(5^) (X /q k'^Airk'^dk oc x~^'^~^^\ Hence, in terms of a mass M a x^, we have 

6,n.s oc M-("+3)/^ (85) 

Similarly, a power-law spectrum implies a power-law correlation function. If ^(r) = 
{r/ro)~'', with 7 = n -|- 3, the corresponding 3D power spectrum is 

A^k) = - {kror r(2 - 7) sin = p^kr^y (86) 

TT / 

(= 0.903(fcro)^'^ if 7 = 1.8). This expression is only valid for n < (7 < 3); for larger 
values of n, ^ must become negative at large r (because P(0) must vanish, implying 
/o°° ^^'"'^ r"^ dr = 0). A cutoff in the spectrum at large k is needed to obtain physically 
sensible results. 

What general constraints can we set on the value of n? Asymptotic homogeneity 
clearly requires n > —3. An upper limit of n < 4 comes from an argument due to 
Zeldovich. Suppose we begin with a totally uniform matter distribution and then group 
it into discrete chunks as uniformly as possible. It can be shown that conservation of 
momentum in this process means that we cannot create a power spectrum that goes 
to zero at small wavelengths more rapidly than Sk oc k^. Thus, discreteness of matter 
produces the minimal spectrum, n = 4. More plausible alternatives lie between these 
extremes. The value n = corresponds to white noise, the same power at all wavelengths. 
This is also known as the Poisson power spectrum, because it corresponds to fiuctuations 
between different cells that scale as A density field created by throwing down a 

large number of point masses at random would therefore consist of white noise. Particles 
placed at random within cells, one per cell, create an n = 2 spectrum on large scales. 

Practical spectra in cosmology, conversely, have negative eff'ective values of n over a 
large range of wavenumber. For many years, the data on the galaxy correlation function 
were consistent with a single power law: 

see Peebles (1980), Davis & Peebles (1983). This corresponds to n ^ — 1.2. By contrast 
with the above examples, large-scale structure is 'real', rather than reflecting the low-A; 
Fourier coefficients of some small-scale process. 

The Zeldovich spectrum Most important of all is the scale-invariant spectrmu, 
which corresponds to the value n = 1, i.e. oc k'^. To see how the name arises, 
consider a perturbation $ in the gravitational potential: 

S/"^^ = AnGpoS =J> = -47rGpo4/fc^ (88) 

The two powers of k pulled down by mean that, if oc fc^ for the power spectrum 
of density fluctuations, then A| is a constant. Since potential perturbations govern the 
flatness of spacetime, this says that the scale-invariant spectrum corresponds to a metric 
that is a fractal: spacetime has the same degree of 'wrinkliness' on each resolution scale. 
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The total curvature fluctuations diverge, but only logarithmically at either extreme of 
wavelength. 

Another way of looking at this spectrum is in terms of perturbation growth balancing 
the scale dependence oi 5: 5 oc a;~*^"+^)/^. We know that 5 viewed on a given comoving 
scale will increase with the size of the horizon: 5 oc r|. At an arbitrary time, though, the 
only natural length provided by the universe (in the absence of non-gravitational effects) 
is the horizon itself: 

S{r.) oc r^rn = r-^^'^/'. (89) 

Thus, if n = 1, the growth of both and 6 with time cancels out so that the uni- 
verse always looks the same when viewed on the scale of the horizon; such a universe is 
self-similar in the sense of always appearing the same under the magnification of cosmo- 
logical expansion. This spectrum is often known as the Zeldovich spectrum (sometimes 
hyphenated with Harrison and Peebles, who invented it independently). 

The generic nature of the scale-invariant spectrum makes it difficult to use as a test, 
since many theories may be expected to have a chance of yielding something like a fractal 
spacetime. The interesting aspect to focus on is therefore where theory predicts devia- 
tions from this rule. Inflation is an interesting case, since the horizon-scale amplitude is 
expected to change logarithmically with scale in simple models (Hawking 1982): 

JhOC [-ln(fcrL"")]", (90) 

where a is a constant of order unity that depends on the inflationary potential (a = 2 
for V{^) = for example). Since the proper horizon scale at the end of inflation 

cannot be infinitely small {a™^r™^ > ^pianck), we see that Sh should vary by a small but 
definite amount over the range of scales that can be probed by the CMB and large-scale 
structure (a change by a factor 1.07 between k = 0.1 /iMpc~^ and 10~^ /iMpc~^, taking 
a = 1, rjffl = fpia„ck/a'"" and a'"« ~ 2.73/rpia„ck, so that rjffl = 10-3 °8m). This 
is pretty close to scale-invariance, but shows that small amounts of tilt are potentially 
observable if sufficiently accurate observations can be made. 

4.2. CDM models for structure formation 

The elements discussed so far assemble into the ACDM cosmological model, which is the 
simplest possibility that is consistent with current evidence. The overall matter power 
spectrum is written dimensionlessly as the logarithmic contribution to the fractional 
density variance, a^: 

A'{k) = ^o,k'\6,\'ock'+", (91) 
din K 

which undergoes linear growth 

Tk, (92) 

where the linear growth law is D{a) = ag['[l{a)] in the matter era, and the growth 
suppression factor for a density parameter 1 is 

(93) 

The transfer function Tk depends on the dark- matter content as discussed earlier, in par- 
ticular displaying a horizon-scale break at fc ~ 0.06{nmh)~^ hMpc~^. Weak oscillatory 
features are also expected if the universe has a significant baryon content. Eisenstein & 
Hu (1998) give an accurate fitting formula that describes these wiggles. This detailed fit 
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of the CDM spectrum is to be preferred to the older notation in which the spectrum was 
described by the zero-baryon form, but with an effective value of Qmh that allowed for 
the main effects of the baryon content: 

(f2„/i)eff = r = n^h exp [-f^6(l + ^/2h/n)] (94) 

(Sugiyama 1995). 

5. Comparison with 2dFGRS data 

5.1. Survey overview 

The largest dataset for which a thorough comparison with the above picture has been 
made is the 2dF Galaxy Redshift Survey (2dFGRS). This survey was designed around 
the 2dF multi-fibre spectrograph on the Anglo- Australian Telescope, which is capable 
of observing up to 400 objects simultaneously over a 2 degree diameter field of view. 
For details of the instrument and its performance see http://www.aao.gov.au/2df/, 
and also Lewis ct al. (2002). The source catalogue for the survey is a revised and 
extended version of the APM galaxy catalogue (Maddox et al. 1990a,b,c); this includes 
over 5 million galaxies down to bj = 20.5 in both north and south Galactic hemispheres 
over a region of almost lO'^dcg^. The 6,, magnitude system is related to the Johnson- 
Cousins system by bj = B — 0.304(i? — V), where the colour term is estimated from 
comparison with the SDSS Early Data Release (Stoughton et al. 2002). 




Figure 4. The distribution of galaxies in part of the 2dFGRS: slices 4° thick, centred at 
declination —2.5° in the NGP and —27.5° in the SGP. This magnificently detailed image of 
large-scale structure provides the basis for measuring the shape of the primordial fluctuation 
spectrum and hence constraining the matter content of the universe. 



The 2dFGRS geometry consists of two contiguous declination strips, plus 100 random 
2-degree fields. One strip is in the southern Galactic hemisphere and covers approx- 
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imately 75°xl5° centred close to the SGP at {a,5)={0l'^,-3Q°); the other strip is in 
the northern Galactic hemisphere and covers 75° x 7.5° centred at (a, i5)=(12.5^,+0°). 
The 100 random fields are spread uniformly over the 7000 deg^ region of the APM cat- 
alogue in the southern Galactic hemisphere. The sample is limited to be brighter than 
an extinction-corrected magnitude of 6j = 19.45 (using the extinction maps of Schlegel 
et al. 1998). This limit gives a good match between the density on the sky of galaxies 
and 2dF fibres. 

After an extensive period of commissioning of the 2dF instrument, 2dFGRS observing 
began in earnest in May 1997, and terminated in April 2002. In total, observations were 
made of 899 fields, yielding redshifts and identifications for 232,529 galaxies, 13976 stars 
and 172 QSOs, at an overall completeness of 93%. The galaxy redshifts are assigned a 
quality flag from 1 to 5, where the probability of error is highest at low Q. Most analyses 
are restricted to Q > 3 galaxies, of which there are currently 221,496. An interim data 
release took place in July 2001, consisting of approximately 100,000 galaxies (sec CoUess 
et al. 2001 for details). A public release of the full photometric and spectroscopic 
database is scheduled for July 2003. The completed 2dFGRS yields a striking view of 
the galaxy distribution over large cosmological volumes. This is illustrated in figure 5, 
which shows the projection of a subset of the galaxies in the northern and southern strips 
onto (a, z) slices. This picture is the culmination of decades of effort in the investigation 
of large-scale structure, and we are fortunate to have this detailed view for the first time. 

5.2. The 2dFGRS power spectrum and CDM models 

Perhaps the key aim of the 2dFGRS was to perform an accurate measurement of the 3D 
clustering power spectrum, in order to improve on the APM result, which was deduced 
by deprojection of angular clustering (Baugh & Efstathiou 1993, 1994). The results of 
this direct estimation of the 3D power spectrum are shown in figure 5 (Percival et al. 
2001). This power-spectrum estimate uses the FFT-based approach of Feldman, Kaiser 
& Peacock (1994), and needs to be interpreted with care. Firstly, it is a raw redshift- space 
estimate, so that the power beyond k ~ 0.2 hMpc~^ is severely damped by smearing due 
to peculiar velocities, as well as being affected by nonlinear evolution. Finally, the FKP 
estimator yields the true power convolved with the window function. This modifies the 
power significantly on large scales (roughly a 20% correction). An approximate correction 
for this has been made in figure 5. 

The fundamental assumption is that, on large scales, linear biasing applies, so that 
the nonlinear galaxy power spectrum in redshift space has a shape identical to that of 
linear theory in real space. This assumption is valid for k < 0.15/iMpc~^; the detailed 
justification comes from analyzing realistic mock data derived from A/'-body simulations 
(Cole et al. 1998). The free parameters in fitting CDM models are thus the primordial 
spectral index, n, the Hubble parameter, h, the total matter density, flm, and the baryon 
fraction, flb/^m- Note that the vacuum energy does not enter. Initially, we show results 
assuming n = 1; this assumption is relaxed later. 

An accurate model comparison requires the full covariance matrix of the data, because 
the convolving effect of the window function causes the power at adjacent k values to 
be correlated. This covariance matrix was estimated by applying the survey window to 
a library of Gaussian realisations of linear density fields, and checked against a set of 
mock catalogues. It is now possible to explore the space of CDM models, and likelihood 
contours in Clf,/Cl„i versus ilmh are shown in figure 6. At each point in this surface we 
have marginalized by integrating the likelihood surface over the two free parameters, h 
and the power spectrum amplitude. We have added a Gaussian prior h = 0.7 ± 10%, 
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k / h Mpc 

Figure 5. The 2dFGRS redshift-space dimensionless power spectrum, A^(fc), estimated ac- 
cording to the FKP procedure. The sohd points with error bars show the power estimate. The 
window function correlates the results at different k values, and also distorts the large-scale 
shape of the power spectrum An approximate correction for the latter effect has been applied. 
The solid and dashed lines show various CDM models, all assuming n = 1. For the case with 
non-negligible baryon content, a big-bang nucleosynthesis value of Q^h^ ~ 0.02 is assumed, 
together with h = 0.7. A good fit is clearly obtained for fim/i — 0.2. Note that the observed 
power at large k will be boosted by nonlinear effects, but damped by small-scale random peculiar 
velocities. It appears that these two effects very nearly cancel, but model fitting is generally 
performed only at fc < 0.15 hMpc~^ in order to avoid these complications. 

representing external constraints such as the HST key project (Freedman et al. 2001); 
this has only a minor effect on the results. 

Figure 6 shows that there is a degeneracy between f2,„/i and the baryonic fraction 
However, there are two local maxima in the likelihood, one with Q^h ~ 0.2 
and 20% baryons, plus a secondary solution ilmh ~ 0.6 and ^ 40% baryons. The 
high-density model can be rejected through a variety of arguments, and the preferred 
solution is 

rJ™/i = 0.20 ± 0.03; r2bM„ = 0.15 ±0.07. (95) 

The 2dFGRS data are compared to the best-fit linear power spectra convolved with the 
window function in figure 6. The low-density model fits the overall shape of the spectrum 
with relatively small 'wiggles', while the solution at ilmh — 0.6 provides a better fit to the 
bump at fc ~ 0.065 /i Mpc~^, but fits the overall shape less well. A preliminary analysis 
of P{k) from the full final dataset shows that P{k) becomes smoother: the high-baryon 
solution becomes disfavoured, and the uncertainties narrow slightly around the lower- 
density solution: ^Imh = 0.18 ± 0.02; ilb/rim = 0.17 ± 0.06. The lack of large-amplitude 
oscillatory features in the power spectrum is one general reason for believing that the 
universe is dominated by coUisionless nonbaryonic matter. In detail, the constraints on 
the collisional nature of dark matter are weak, since all we require is that the effective 
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Figure 6. Likelihood contours for the best-fit hnear CDM fit to the 2dFGRS power spec- 
trum over the region 0.02 < k < 0.15. Contours are plotted at the usual positions for 
one-parameter confidence of 68%, and two-parameter confidence of 68%, 95% and 99% (i.e. 
—2 ln(£/£niax) = 1, 2.3, 6.0, 9.2). We have marginalized over the missing free parameters {h and 
the power spectrum amplitude). A prior on h of h = 0.7 d= 10% was assumed. This result is 
compared to estimates from X-ray cluster analysis (Evrard 1997) and big-bang nucleosynthesis 
(Buries ot al. 2001). The second panel shows the 2dFGRS data compared with the two preferred 
models from the Maximum Likelihood fits convolved with the window function (solid lines) . The 
unconvolved models are also shown (dashed lines). The i^mh ~ 0.6, ^Ib/^m = 0.42, h = 0.7 
model has the higher bump at fc ~ 0.05 /iMpc"'^. The smoother Qmh ~ 0.20, Qb/^m = 0.15, 
ft = 0.7 model is a better fit to the data because of the overall shape. A preliminary analysis of 
the complete final 2dFGRS sample yields a slightly smoother spectrum than the results shown 
here (from Percival et al. 2001), so that the high-baryon solution becomes disfavoured. 



sound speed for modes of 100- Mpc wavelength is less than about 0.1c. Nevertheless, if 
a purc-baryon model is ruled out, the next simplest alternative would arguably be to 
introduce a weakly-interacting relic particle, so there is at least circumstantial evidence 
in this direction from the power spectrum. 

It is interesting to compare these conclusions with other constraints. These arc shown 
on figure 6, again assuming h = 0.7±10%. Estimates of the Deuterium to Hydrogen ratio 
in QSO spectra combined big-bang nucleosynthesis theory predict fife/i^ = 0.020 ± 0.001 
(Buries et al. 2001), which translates to the shown locus of vs ^Imh. X-ray cluster 
analysis yields a baryon fraction Qb/^rn = 0.127 ± 0.017 (Evrard 1997) which is within 
IfT of our value. These loci intersect very close to our preferred model. 

Perhaps the main point to emphasise here is that the 2dFGRS results are not greatly 
sensitive to the assumed tilt of the primordial spectrum. As discussed below, CMB data 
show that n = 1 is a very good approximation; in any case, very substantial tilts (n ~ 0.8) 
are required to alter the conclusions significantly. 

5.3. Robustness of results 

The main residual worry about accepting the above conclusions is probably whether the 
assumption of linear bias can really be valid. In general, concentration towards higher- 
density regions both raises the amplitude of clustering, but also steepens the correlations, 
so that bias is largest on small scales, as discussed below. We need to be clear of the 
regime in which the bias depends on scale. 

One way in which this issue can be studied is to consider subsamples with very different 
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Figure 7. The power spectra of red galaxies (filled circles) and blue galaxies (open circles), 
divided at photographic B — R = 0.85. The shapes are strikingly similar, and the square root 
of the ratio yields the right-hand panel: the relative bias in redshift space of red and blue 
galaxies. The error bars axe obtained by a jack-knife analysis. The relative bias is consis- 
tent with a constant value of 1.4 over the range used for fitting of the power-spectrum data 
(0.015 < A; < 0.15/iMpc-^). 

degrees of bias. Colour information has recently been added to the 2dFGRS database 
using SuperCosmos scans of the UKST red plates (Hambly ct al. 2001), and a division at 
rest-frame photographic B ~ R = 0.85 nicely separates ellipticals from spirals. Figure 7 
shows the power spectra for the 2dFGRS divided in this way. The shapes are almost 
identical (perhaps not so surprising, since the cosmic variance effects are closely correlated 
in these co-spatial samples). However, what is impressive is that the relative bias is 
almost precisely independent of scale, even though the red subset is rather strongly 
biased relative to the blue subset (relative & ~ 1.4). This provides sonic reassurance that 
the large-scale P{k) reflects the underlying properties of the dark matter, rather than 
depending on the particular class of galaxies used to measure it. 




6. Relation of galaxies and dark matter 

6.1. History and general aspects of bias 

In order to make full use of the cosinological information encoded in large-scale structure, 
it is essential to understand the relation between the number density of galaxies and the 
mass density field. It was first appreciated during the 1980s that these two fields need not 
be strictly proportional. Until this time, the general assumption was that galaxies 'trace 
the mass'. Since the mass density is a continuous field and galaxies are point events, the 
approach is to postulate a Poisson clustering hypothesis, in which the number of galaxies 
in a given volume is a Poisson sampling from a fictitious number-density field that is 
proportional to the mass. Thus within a volume V, 

{Ng{V)) (X M{V). (96) 

With allowance for this discrete sampling, the observed numbers of galaxies, Ng, would 
give an unbiased estimate of the mass in a given region. 

The first motivation for considering that galaxies might in fact be biased mass tracers 
came from attempts to reconcile the = 1 Einstein-de Sitter model with observations. 
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Although M/L ratios in rich clusters argued for dark matter, as first shown by Zwicky 
(1933), typical blue values of M/L ~ 300/i implied only Qm — 0.2 if they were taken 
to be universal. Those who argued that the value flm = 1 was more natural (a greatly 
increased camp after the advent of inflation) were therefore forced to postulate that 
the efficiency of galaxy formation was enhanced in dense environments: biased galaxy 
formation. 

We can note immediately that a consequence of this bias in density will be to affect 
the velocity statistics of galaxies relative to dark matter. Both galaxies and dark-matter 
particles follow orbits in the overall gravitational potential well of a cluster; if the galax- 
ies are to be more strongly concentrated towards the centre, they must clearly have 
smaller velocities than the dark matter. This is the phenomenon known as velocity bias 
(Carlberg, Couchman & Thomas 1990). 

An argument for bias at the opposite extreme of density arose through the discovery 
of large voids in the galaxy distribution (Kirshner et al. 1981). There was a reluctance 
to believe that such vast regions could be truly devoid of matter - although this was at 
a time before the discovery of large-scale velocity fields. This tendency was given further 
stimulus through the work of Davis, Efstathiou, Frenk & White (1985), who were the 
first to calculate A/'-body models of the detailed nonlinear structure arising in CDM- 
dominated universes. Since the CDM spectrum curves slowly between effective indices 
of n = —3 and n = 1, the correlation function steepens with time. There is therefore a 
unique epoch when ^ will have the observed slope of —1.8. Davis et al. identified this 
epoch as the present and then noted that, for Cl„i = 1, it implied a rather low amplitude 
of fluctuations: ro = 1.3h~^ Mpc. An independent argument for this low amplitude 
came from the size of the peculiar velocities in CDM models: if the spectrum was given 
an amplitude corresponding to the erg ~ 1 seen in the galaxy distribution, the pairwise 
dispersion was ap ~ 1000 - 1500 km s~^, around 3 times the observed value. What seemed 
to be required was a galaxy correlation function that was an amplified version of that for 
mass. This was exactly the phenomenon analysed for Abell clusters by Kaiser (1984), 
and thus was born the idea of high-peak bias: bright galaxies form only at the sites of 
high peaks in the initial density field. This was developed in some analytical detail by 
Bardecn ct al. (1986), and was implemented in the simulations of Davis et al. (1985). 

As shown below, the high-peak model produces a linear amplification of large- wavelength 
modes. This is likely to be a general feature of other models for bias, so it is useful to 
introduce the linear bias parameter: 



This seems a reasonable assumption when Sp/p <C 1, although it leaves open the question 
of how the effective value of b would be expected to change on nonlinear scales. Galaxy 
clustering on large scales therefore allows us to determine mass fluctuations only if we 
know the value of b. When we observe large-scale galaxy clustering, we are only measuring 



Later studies of bias concentrated on general models. A fruitful assumption is that 
bias is local, so that the number density of galaxies is some nonlinear function of the 
mass density 




(97) 



b'UUr) or 62ALss(fc)- 



J^sW = /[pm(r)]. 



(98) 



Coles (1993) proved the powerful result that, whatever the function / may be, the quan- 
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Figure 8. The high-peak bias modeL If we decompose a density field into a fluctuating 
component on galaxy scales, together with a long- wavelength 'swell' (shown dashed), then those 
regions of density that lie above a threshold in density of v times the rms will be strongly 
clustered. If proto-objects are presumed to form at the sites of these high peaks (shaded, and 
indicated by arrows), then this is a population with Lagrangian bias - i.e. a non-uniform 
spatial distribution oven prior to dynamical evolution of the density field. The key question 
is the physical origin of the threshold; for massive objects such as clusters, the requirement of 
collapse by the present imposes a threshold of > 2. For galaxies, there will be no bias without 
additional mechanisms to cause star formation to favour those objects that collapse first. 



tity 

Hr) = ^Ur)/Uir) (99) 

had to show a monotonic dependence on scale, provided the mass density field had 
Gaussian statistics. An interesting concrete example of this is provided by the lognormal 
density field (Coles & Jones 1991); this is generated by exponentiation of a Gaussian 
field: 

1 + = exp(5o - ctV2), (100) 

where cr^ is the total variance in the Gaussian field. These authors argue that this 
analytical form is a reasonable approximation to the exact nonlinear evolution of the 
mass density distribution function, preventing the unphysical values ^ < — 1. This non- 
Gaussian model is built upon an underlying Gaussian field, so the joint distribution of 
the density at n points is still known. This means that the correlations are simple enough 
to calculate, the result being 

= exp(CG) - 1. (101) 

This says that ^ on large scales is unaltered by nonlinearities in this model; they only add 
extra small-scale correlations. Using the lognormal model as a hypothetical nonlinear 
density field, we can now introduce bias. A nonlinear local transformation pg oc p^^ then 
gives a correlation function 1 -|- = (1 -|- ^ln)** (Mann, Peacock & Heavens 1998). The 
linear bias parameter is b, but the correlations steepen on small scales, as expected for 
Coles' result. 

In reality, bias is unlikely to be completely causal, and this has led some workers to 
explore stochastic bias models, in which 

% = /(p™) + e, (102) 

where e is a random field that is uncorrelated with the mass density (Pen 1998; Dekel & 
Lahav 1999). This means we need to consider not only the bias parameter defined via 
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the ratio of correlation functions, but also the correlation coefficient, r, between galaxies 
and mass: 

W 'W, . (103) 



Although truly stoc;liastic effects arc possible in galaxy formation, a relation of the above 
form is expected when the galaxy and mass densities are filtered on some scale (as they 
always are, in practice). Just averaging a galaxy density that is a nonlinear function 
of the mass will lead to some scatter when comparing with the averaged mass field; a 
scatter will also arise when the relation between mass and light is non-local, however, 
and this may be the dominant effect. 

6.2. The peak-background split 

We now consider the central mechanism of biased clustering, in which a rare high density 

fluctuation, corresponding to a massive object, collapses sooner if it lies in a region of 
large-scale overdensity. This 'helping hand' from the long-wavelength modes means that 
overdense regions contain an enhanced abundance of massive objects with respect to 
the mean, so that these systems display enhanced clustering. The basic mechanism can 
be immediately understood via the diagram in figure 8; it was first clearly analysed by 
Kaiser (1984) in the context of rich clusters of galaxies. What Kaiser did not do was 
consider the degree of bias that applies to more typical objects; the generalization to 
consider objects of any mass was made by Cole & Kaiser (1989; see also Mo & White 
1996 and Sheth et al. 2001). 

The key ingredient of this analysis is the mass function of dark-matter haloes. The 
universe fragments into virialized systems such that f{M) dM is the number density 
of haloes in the mass range dM] conservation of mass requires that / M f{M)dM = 
pQ. A convenient related dimensionless quantity is therefore the multiplicity function, 
M"^ f{M)/ pq, which gives the fraction of the mass of the universe contained in haloes of 
a unit range in InM. The simplest analyses of the mass function rest on the concept 
of a density threshold: collapse to a virialized object is deemed to have occurred where 
linear-theory 6 averaged over a box containing mass M reaches some critical value dc- 
Generally, we shall assume the value = 1.686 appropriate for spherical collapse in an 
Einstein-de Sitter universe. Now imagine that this situation is perturbed, by adding 
some constant shift e to the density perturbations over some large region. The effect of 
this is to perturb the threshold: fluctuations now only need to reach 6 = 5c — e in order 
to achieve collapse. The number density is therefore modulated: 

/^/-^c. (104) 

This gives a bias in the number density of haloes in Lagrangian space: Sf/f = bi^e, where 
the Lagrangian bias is 

rfln f , ^ 

In addition to this modulation of the halo properties, the large-scale disturbance will 

move haloes closer together where e is large, giving a density contrast of 1 + e. If e <C 1, 
the overall fractional density contrast of haloes is therefore the sum of the dynamical 
and statistical effects: 5haio = e + ^lE. The overall bias in Eulerian space (6 = (5haio/e) is 
therefore 
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Of course, the field e can hardly be imposed by hand; instead, we make the peak- 
background split, in which 5 is mentally decomposed into a small-scale and a large-scale 
component - which wc identify with e. The scale above which the large-scale component 
is defined does not matter so long as it lies between the sizes of collapsed systems and 
the scales at which we wish to measure correlations. 

To apply this, we need an explicit expression for the mass function. The simplest 
alternative is the original expression of Press & Schechter (1974), which can be written 
in terms of the parameter u = 6c/<j{M): 

M^f{M)/po = exp (^—^ . (107) 

We now use d/d5c = cT{M)~^{d/du) = {u/5c){d/dv), since M is not affected by the 
threshold change, which yields 

!/2-l 

h{v) = 1 + — — . (108) 

Oc 

This says that M* haloes arc imbiascd, low-mass haloes are antibiased and high-mass 
haloes are positively biased, eventually reaching the b = vja value expected for high 
peaks. The corresponding expression can readily be deduced for more accurate fitting 
formulae for the mass function, such as that of Sheth & Tormen (1999): 

mV(M)/po = 0.21617[1 (\/2/zy2)0-3] y exp[-i/V(2\/2)]. (109) 

Wo can now imdcrstand the observation that Abcll clusters arc miich more strongly 
clustered than galaxies in general: regions of large-scale overdensity contain systemat- 
ically more high- mass haloes than expected if the haloes traced the mass. This phe- 
nomenon was dubbed natural bias by White ct al. (1987). However, applying the idea to 
galaxies is not straightforward: we have shown that enhanced clustering is only expected 
for massive fluctuations with a < 1, but galaxies at ^; = fail this criterion. The high- 
peak idea applies will at high rcdshift, where massive galaxies arc still assembling, but 
today there has been time for galaxy-scale haloes to collapse in all environments. The 
large bias that should exist at high redshifts is erased as the mass fluctuations grow: if the 
Lagrangian component to the biased density field is kept unaltered, then the present-day 
bias will tend to unity as 

(Fry 1986: Tcgmark & Peebles 1998). Strong galaxy bias at z = therefore requires 
some form of selection that locates present-day galaxies preferentially in the rarer haloes 
with M> M* (Kauff'mann, Nusser & Steinmetz 1997). 

This dilemma forced the introduction of the idea of high-peak bias: bright galaxies 
form only at the sites of high peaks in the initial density field (Bardeen et al. 1986; 
Davis et al. 1985). This idea is commonly, but incorrectly, attributed to Kaiser (1984), 
but it needs an extra ingredient, namely a non-gravitational threshold. Attempts were 
therefore made to argue that the first generation of objects could propagate disruptive 
signals, causing neighbours in low-density regions to be 'still-born'. It is then possible 
to construct models [e.g. Bower ct al. 1993) in which the large-scale modulation of the 
galaxy density is entirely non-gravitational in nature. However, it turned out to be hard 
to make such mechanisms operate: the energetics and required scale of the phenomenon 
are very large (Rees 1985; Dekel & Recs 1987). These difficulties were only removed when 
the standard model became a low-density universe, in which the dynamical argument for 
high galaxy bias no longer applied. 
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Figure 9. The correlation length for clusters of galaxies, ro, as a function of mean intercluster 
separation, dc, taken from Colberg et al. (2000). Results arc shown for rCDM (open squares) 
and ACDM (filled squares) simulations. The predictions of Sheth et al. (2001) arc shown as 
solid lines. Also shown are data from the APM cluster catalogue (open triangles), taken from 
Croft et al. (1997). 



As indicated above, the first strong indications of biased clustering came from measure- 
ments of the correlation function of Abell clusters, which showed a far greater amplitude 
than for galaxies in general (Klypin & Kopylov 1983: Bahcall & Soneira 1983). Following 
Kaiser (1984), Cole & Kaiser (1989) etc., our explanation for this is that massive haloes 
show clustering that is an increasing function of mass. This is illustrated in figure 9, 
which shows that the rarest and most rich clusters (as measured by the intercluster 
separation) have the highest clustering, and that the trend is in agreement with the 
theoretical predictions. 

Because galaxy halo masses are less extreme, it is not so clear a priori that any trend of 
this sort should be expected for galaxies. However, our empirical knowledge of luminosity 
functions and morphological segregation did argue for an effect. It has been clear for many 
years that elliptical galaxies display a higher correlation amplitude than spirals (Davis 
& Geller 1976), and this makes sense in terms of the preference of ellipticals for cluster 
environments. Since ellipticals are also more; Imriinous on average than spirals, some 
trend with luminosity is to be expected, but the challenge is to detect it. For a number 
of years, the existence of any efltect was controversial (e.g. Loveday et al. 1995; Benoist 
et al. 1996), but Norberg et al. (2001) were able to use the 2dFGRS to demonstrate 
very clearly that the effect existed, as shown in Figure 10. The results can be described 
by a linear dependence of eflfective bias parameter on luminosity: 



and the scale-length of the real-space correlation function for L* galaxies is approximately 



6.3. Observations of biased clustering 



b/b* =0.85 + 0.15 (L/L*), 



(111) 
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Figure 10. (a) The correlation length in real space as a function of absolute magnitude. The 
solid line shows the predictions of the semi-analytic model of Benson et al. (2001), computed in 
a series of overlapping bins, each 0.5 magnitudes wide. The dotted curves show an estimate of 
the errors on this prediction, including the relevant sample variance for the survey volume, (b) 
The real space correlation length estimated combining the NGP and SGP (filled circles). The 
open symbols show a selection of recent data from other studies. 

ro = 4.8/i~^Mpc. Finally, with spectral classifications, it is possible to measure the 
dependence of clustering both on luminosity and on spectral type, to see to what extent 
morphological segregation is responsible for this result. Norbcrg ct al. (2002) show that, 
in fact, the principal effect seems to be with luminosity: ^(r) increases with L for all 
spectral types. 

Finally, wc can look at high-rcdshift clustering. At high enough rcdshift, M* is of 
order a galaxy mass and galaxies could be strongly biased relative to the mass at that 
time. Indeed, there is good evidence that this is the case. Steidel et al. (1997) 
have used the Lyman-limit technique to select galaxies around redshifts 2.5 z 3.5 
and found their distribution to be highly inhomogeneous. The apparent value of erg for 
these objects is of order unity (Adelberger et al. 1998), whereas the present value of 
as — 0.8 should have evolved to about 0.26 at these redshifts (for flm = 0.3, fc = 0). 
This suggests a bias parameter of 6 ~ 4, or ~ 2.5, which requires a halo mass of 
about lO^^'^/i~^M0 for concordance ACDM. The masses of these high-redshift galaxies 
can be estimated directly through their stellar masses, which arc typically 10^'^ h^'^ Mq 
(Papovich, Dickinson & Ferguson 2001), and thus only 1% of what is required in order 
to explain the clustering. This is an unreasonably small baryon fraction, so the correct 
explanation is more plausibly that each lO^^/i^^M© halo at 2; = 3 contains a number of 
Lyman-break galaxies. This theme is pursued below. 

6.4. Scale dependence of bias 

The Poisson clustering hypothesis would propose that galaxies are simply a dilute sam- 
pling of the mass field. If this were a correct hypothesis, no CDM universe would be 
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Figure 11. The correlation function of galaxies in the somianalytical simulation of an LCDM 
universe by Benson et al. (2000a). Although the nonlinear correlations of the mass show a 
characteristic convex bulge at separations around 1 Mpc, the galaxy data follow a power law 
which thus is antibiased on these small scales. The simulation is successful at reproducing this 
trend. 

acceptable, since the correlation functions for these models differ from the observed 
galaxy correlations in a complicated scale-dependent fashion (e.g. Klypin, Primack & 
Holtzman 1996; Peacock 1997; Jenkins et al. 1998). 

For a few years, this failure of CDM models to match the shape of the galaxy power 
spectrum was seen as a serious problem, but this was eventually resolved by more detailed 
theoretical predictions for galaxy clustering. Two approaches are being followed in this 
regard. The brute-force method is to perform A''-body simulations in which the evolution 
of both collisionless dark matter and dissipative gas is followed, with the physical state 
of the gas (i.e. its ability to cool) being used as a cue to insert star formation. The 
stars in turn are allowed to feed energy back into the gas, simulating the effects of mass 
loss and supernovae. This determines the star formation history of a given halo, and 
its appearance can be predicted using spectral synthesis codes. This is challenging, but 
starting to be feasible with current computing power (Pearce et al. 2001). The alternative 
is 'semianalytic' modelling, in which the merging history of dark-matter haloes is treated 
via the extended Press-Schechter theory (Bond et al. 1991), and the location of galaxies 
within haloes is estimated using dynamical-friction arguments {e.g. Kauffmann et al. 
1993, 1999; Cole et al. 1994, 2000; Somerville & Primack 1999; van Kampen, Jimenez & 
Peacock 1999; Benson et al. 2000a,b). 

Both these approaches have yielded similar conclusions, and shown how CDM models 
can match the galaxy data: specifically, the low-density flat ACDM model that is favoured 
on other grounds can yield a correlation function that is close to a single power law over 
1000 ^ ^ ^ 1, even though the mass correlations show a marked curvature over this 
range (Pearce et al. 1999; 2001; Benson et al. 2000a; see figure 11). These results 
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are impressive, yet it is frustrating to have a result of such fundamental importance 
emerge from a complicated calculational apparatus. There is thus some motivation for 
constructing a simpler heuristic model that captures the main processes at work in the full 
semianalytic models. The following section describes an approach of this sort (Peacock 
& Smith 2000; Seljak 2000; Cooray & Sheth 2002). 

6.5. The halo model - I: mass 

The formation of galaxies must be a non-local process to some extent, and the modern 
paradigm was introduced by White & Rees (1978): galaxies form through the cooling of 
baryonic material in virialized haloes of dark matter. The virial radii of these systems 
are in excess of 0.1 Mpc, so there is the potential for large differences in the correlation 
properties of galaxies and dark matter on these scales. The 'halo model' addresses this by 
creating a density field in which dark-matter haloes are superimposed. The key feature 
that allows bias to be included is to encode all the complications of galaxy formation 
via the halo occupation number: the number of galaxies found above some luminosity 
threshold in a virialized halo of a given mass. 

To some extent, this is a very old idea: one of the earliest suggested models for the 
galaxy correlation function was to consider a density field composed of randomly-placed 
independent clumps with some universal density profile (Neyman, Scott & Shane 1953; 
Peebles 1974). Since the clumps are placed at random (with number density n), the only 
excess neighbours to a given mass point arise from points in the same clump, and the 
correlation function is straightforward to compute in principle. For the case where the 
clumps have a power-law density profile, 

p = nBr-\ (112) 

truncated dX r — R, the small-r behaviour of the correlation function is ^ oc r'^"^'^, 
provided 3/2 < e < 3. For smaller values of e, ^(r) tends to a constant as r — > 0. In the 
isothermal e = 2 case, the correlation function for r <C i? is 

^(^^ = ^ = 16^^ 
where N is the total number of particles per clump (Peebles 1974). 

The general result is that the correlation function is less steep at small r than the 
clump density profile, which is inevitable because an autocorrelation function involves 
convolving the density field with itself. A long-standing problem for this model is there- 
fore that the predicted correlation function is much flatter than is observed for galaxies: 
^ oc r~^'* is the canonical slope, apparently requiring clumps with very steep density 
profiles, e = 2.4. This is not in agreement with the profiles of dark-matter haloes as 
'observed' in numerical simulations. 

Traditionally, virialized systems have been found by a criterion based on percolation 
('friends-of-friends'), such that the mean density is about 200 times the mean. Sometimes, 
the criterion is taken as a density of 200 times the critical value. We shall use the former 
definition: 



\800TTpb) ■ ^^^^^ 



Thus r„ is related to the Lagrangian radius containing the mass via r„ = i?/200^/^. 
Of course, the density contrast used to define the boundary of an object is somewhat 
arbitrary. Fortunately, much of the mass resides at smaller radii, near a 'core radius'. 
These core radii are relatively insensitive to the exact definition of virial radius. 
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Figure 12. A comparison of various possible density profiles for virialized haloes. The dotted 
line is a singular isothermal sphere. The solid lines show haloes with formation redshifts of 
and 5 according to NFW (fi = 1) and M99. 



The simplest model for the density structure of the virialized system is the singular 
isothermal sphere: p = al/{2nGr^), or 

200 , , , 

p/Pb = {y < 1); y = r/ry. (115) 

A more realistic alternative is the profile proposed by Navarro, Prenk & White (1996; 
NFW): 

Ac 

P/Pb = y^i + y)2 ' i^<rv); y=r/rc. (116) 
The parameter Ac is related to the core radius and the virial radius via 

200cV3 , 

Ac = , ^ . y c = r^ r, 117 

ln(l + c) - 0/(1 + c) 

(we change symbol from NFW's 5c to avoid confusion with the linear-theory density 
threshold for collapse, and also because our definition of density is relative to the mean, 
rather than the critical density). NFW showed that Ac is related to collapse redshift via 

Ac ~ 3000(1 + 0c)^ (118) 

An advantage of the definition of virial radius used here is that Ac is independent of O 
(for given Zc), whereas NFW's Jc is oc f2. 

The above equations determine the concentration, c = ry/rc implicitly, hence in princi- 
ple giving rc in terms of r„ once Ac is known. NFW give a procedure for determining Zc. 
A simplified argument would suggest a typical formation era determined by D{zc) = l/i', 
where D is the linear-theory growth factor between z = Zc and the present, and v is the 
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dimensionless fluctuation amplitude corresponding to the system in units of the rms: 
u = 5c/<j{M), where 5c — 1.686. For very massive systems with v ^ I, only rare fluctua- 
tions have collapsed by the present, so Zc is close to zero. This suggests the interpolation 
formula 

D(zc) = l + l/iy; (119) 

The NFW formula is actually of this form, except that the l/v term is multiplied by a 
spectrum-dependent coefiicient of order unity. It has been claimed by Moore et al. (1999; 
M99) that the NFW density profile is in error at small r. M99 proposed the alternative 
form 

^3/2(f+^3/2) i ir<rvy, y = r/rc. (120) 

It is straightforward to use this in place of the NFW profile. 

We now compute the power spectrum for the halo model. Start by distributing point 
seeds throughout the universe with number density n, in which case the power spectrum 
of the resulting density field is just shot noise: 

(m) 

Here, we use a dimensionless notation for the power spectrum: is the contribution 
to the fractional density variance per unit interval of Infc. In the convention of Peebles 
(1980), this is 

(V being a normalization volume), and the relation to the correlation function is 

N /" a9/,n dk sinfcr /,„„x 
ar)=jA\k)-^. (123) 

The density field for a distribution of clumps is produced by convolution of the initial 
field of delta-functions, so the power spectrum is simply modified by the squared Fourier 
transform of the clump density profile: 

where 

Jp(r)^^4TTrUr 
^ J pi''") 471" r'^ dr ^ ^ 

For a practical calculation, we should also use the fact that hierarchical models are 
expected to contain a distribution of masses of clumps. If we use the notation /(M) dM 
to denote the number density of haloes in the mass range dM, the effective number 
density in the shot noise formula becomes 

1 [M^f(M)dM , , 
= ^-—^ ^. (126 

'^eff {\ M ]{M)dM\ 

The window function also depends on mass, so the overall power spectrum is 

[/Af/(M)<iM]" 
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The normalization term / M f{M)dM just gives the total background density, pb, so 
there is only a single numerical integral to perform. Using this model, it is then possible 
to calculate the; correlations of the nonlinear density field, neglecting only the large-scale 
correlations in halo positions. The power spectrum determined in this way is shown in 
figure 13, and turns out to agree very well with the exact nonlinear result on small and 
intermediate scales. The lesson here is that a good deal of the nonlinear correlations of 
the dark matter field can be understood as a distribution of random clumps, provided 
these are given the correct distribution of masses and mass-dependent density profiles. 




Figure 13. The decomposition of the mass power spectrum according to the halo model, for 
the flat ilm = 0.3, F = 0.2, as = 0.8 case. The dashed line shows linear theory, and the open 
circles show the predicted 1-halo contribution. Adding in linear theory to produce the correct 
large-scale clustering yields the solid points. The full lines show the contribution of different 
mass ranges to the 1-halo term: bins of width a factor 10 in width, starting at 10^° — 10^^/i~^M© 
and ending at 10^^ - W^^h'^MQ. The more massive haloes have larger virial radii and hence 
filter the power spectrum on progressively larger scales. The majority of the quasilincar power 
is contributed by the haloes near the peak in the mass function at lO" - lO^^/i^^M©. 

So far, we have ignored any spatial correlations in the halo positions. A simple guess 
for amending this is to add the linear power spectrum to the power generated by the 
halo structure: 

AL = ALio + ALear- (128) 

The justification for this is that the extra small-scale power introduced by nonlinear 
evolution is associated with the internal structure of the haloes. In practice, this model 
works extremely well, giving an almost perfect description of the power spectrum on all 
scales. This is a novel way of looking at the features in the nonlinear spectrum, par- 
ticularly the steep rise between k ~ 0.5/iMpc^^ and k ^ 5 hMpc~^ , and the flattening 
on smaller scales. According to the ideas presented here, the flat small-scale spectrum 
arises because haloes have central density profiles rising as r~^'^, but not much faster. 
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The sharp fall in power at smaller k reflects the cutoff at the virial radii of the haloes 
that dominate the correlation signal. 

It might be objected that this mocicl is still not completely realistic, since we have 
treated haloes as smooth objects and ignored any substructure. At one time, it was 
generally believed that collisionless evolution would lead to the destruction of galaxy- 
scale haloes when they are absorbed into the creation of a larger-scale nonlinear system 
such as a group or cluster. However, it turns out that this 'overmerging problem' was 
only an artefact of inadequate resolution (see e.g. van Kampen 2000). When a simulation 
is carried out with ~ 10^ particles in a rich cluster, the cores of galaxy-scale haloes can 
still be identified after many crossing times (Ghigna et al. 1998). This substructure must 
have some effect on the correlations of the density field, and indeed Valageas (1999) has 
argued that the high-order correlations of the density field seen in N-hody simulations 
are inconsistent with a model where the density file is composed of smooth virialized 
haloes. Nevertheless, substructure seems to be unimportant at the level of two-point 
correlations. 

The existence of substructure is important for the obvious next step of this work, which 
is to try to understand galaxy correlations within the current framework. It is clear that 
the galaxy-scale substructure in large dark-matter haloes defines directly where luminous 
galaxies will be found, giving hope that the main features of galaxy formation can be 
understood principally in terms of the dark-matter distribution. Indeed, if catalogues 
of these 'sub-haloes' are created within a cosmological-sizcd simulation, their correlation 
function is known to differ from that of the mass (e.g. Klypin et al. 1999; Ma 1999). The 
model of a density field consisting of smooth haloes may therefore be a useful description 
of the galaxy field, and this is explored in the following section. 

6.6. The Halo model - II: biased galaxy populations 

In relating the distribution of galaxies to that of the mass, there are two distinct ways 

in which a degree of bias is inevitable: 

(1) Halo occupation numbers. For low-mass haloes, the probability of obtaining an 
L* galaxy must fall to zero. For haloes with mass above this lower limit, the 
number of galaxies will in general not scale linearly with halo mass. 

(2) Nonlocality. Galaxies can orbit within their host haloes, so the probability of 
forming a galaxy depends on the overall halo properties, not just the density at 
a point. Also, the galaxies can occupy special places within the haloes: for a 
halo containing only one galaxy, the galaxy will clearly mark the halo centre. In 
general, we will assume one central galaxy and a number of satellites. 

The first mechanism leads to large-scale bias, because large-scale halo correlations depend 
on mass, and are some biased multiple of the mass power spectrum: = 6^(M)A^. As 
discussed earlier, the linear bias parameter for a given class of haloes, b{M), depends on 
the rareness of the fluctuation and the rms of the underlying field: 

6=1 + ^ = 1 + ^ (129) 
fcr dc 

(Kaiser 1984; Cole & Kaiser 1989; Mo & White 1996), where = 6c/ a, and cr^ is the 
fractional mass variance at the redshift of interest. This formula is not perfectly accurate, 
but the deviations may be traced to the fact that the Press-Schechter formula for the 
number density of haloes (which is assumed in deriving the bias) is itself systematically 
in error; see Sheth & Tormen (1999). 

If we do not wish to assume that the number of galaxies in a halo of mass M is strictly 
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^tot = 1 + '%oo\v (131) 



proportional to M, we are in effect giving haloes a mass-dependent weight, as was first 
considered by Jing, Mo & Borner (1998). A simple but instructive model for this is 

"'(^^ = I (M/Me)«-i (M > mJ (^=^0) 

A model in which mass traces light would have — > and a = 1. We will show below 
that, empirically, we should choose a < 1. 

The bias formula applies to haloes of a given v, i.e. of a given mass, so the effect of 
mass-dependent weights is 

Where F(> i/) is the fraction of the mass in haloes exceeding a given v; dF/dv oc 
exp(— 1^^/2) according to Press-Schechter theory. The total model for the galaxy power 
spectrum is then 

a^ = (aL,o) + 6LaL (132) 

where 

\ haio^ )/ y^nj [jMwiM)fiM)dM]^ ^ ' 

The key ingredient needed to make this machinery work is the occupation number, 
which in principle needs to be calculated via a detailed numerical model of galaxy for- 
mation. However, for a given assumed background cosmology, the answer may be deter- 
mined empirically. Galaxy redshift surveys have been analyzed via grouping algorithms 
similar to the 'friends-of-friends' method widely employed to find virialized clumps in 
iV-body simulations. With an appropriate correction for the survey limiting magnitude, 
the observed number of galaxies in a group can be converted to an estimate of the total 
stellar luminosity in a group. This allows a determination of the All Galaxy System 
(AGS) luminosity function: the distribution of virialized clumps of galaxies as a fimction 
of their total luminosity, from small systems like the Local Group to rich Abell clusters. 

The AGS function for the CfA survey was investigated by Moore, Prenk & White 
(1993), who found that the result in blue light was well described by 

# = (j)* [{L/L*f + {L/L*y] dL/L*, (134) 

where 0.00126ft,^Mpc"3, /? = 1.34, - = 2.89; the characteristic luminosity is L* = 

7.6 X lO^°/i~^iy0. One notable feature of this function is that it is rather flat at low 
luminosities, in contrast to the mass function of dark-matter haloes (see Sheth & Tormen 
1999). It is therefore clear that any fictitious galaxy catalogue generated by randomly 
sampling the mass is unlikely to be a good match to observation. The simplest cure for 
this deficiency is to assume that the stellar luminosity per virialized halo is a monotonic, 
but nonlinear, function of halo mass. The required luminosity-mass relation is then 
easily deduced by finding the luminosity at which the integrated AGS density $(>!/) 
matches the integrated number density of haloes with mass > M. The result is shown 
in figure 14. 

We can now calculate the halo-based galaxy power spectrum and use semi-realistic 
occupation numbers, N, as a function of mass. This is needs a little care at small 
numbers, however, since the number of haloes with occupation number unity affects the 
correlation properties. These haloes contribute no correlated pairs, so they simply dilute 
the signal from the haloes with N > 2. This means that we need in principle to use 
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halo mass / h Mg 

Figure 14. The empirical luminosity-mass relation required to reconcile the observed AGS 
luminosity function with two variants of CDM. L* is the characteristic luminosity in the AGS 
luminosity function (L* = 7.6 x 10^" Lq). Note the rather flat slope around M = 10^^ to 
lO"/i-^M0, especially for ACDM. 



different weights for the large-scale bias and the halo term: 

m = ^ = (135) 

respectively (Scljak 2000). In practice, this correction has a rather small effect, provided 
the relation between N and M has no scatter. If, in contrast, the distribution of N for 
given M is assumed to obey a Poisson distribution, the small-scale clustering properties 
are strongly affected, and do not match the data well (Benson et al. 2000a). Finally, we 
need to put the galaxies in the correct location, as discussed above. If one galaxy always 
occupies the halo centre, with others acting as satellites, the small-scale correlations 
automatically follow the slope of the halo density profile, which keeps them steep. The 
results of this exercise arc shown in figure 15. This shows that, depending on the range 
of halo masses chosen, the galaxies can be positively or negatively biased with respect to 
the mass, as expected. What is particularly interesting is that the shape of the galaxy 
spectrum is expected to differ from that of the mass. For an appropriate mass range, the 
galaxy power spectrum can be very close to a power law, which has been a long-standing 
puzzle to explain. Interestingly, the power-law should not be perfect; small deviations 
have long been suspected, and were confirmed by Hawkins et al. (2002) and Zehavi et al. 
(2003). The inflection is at a scale of ~ 0.5/iMpc~^, as expected from the halo model. 
Figure 15 also shows that the results of this simple model are encouragingly similar to the 
scale-dependent bias found in the detailed calculations of Benson et al. (2000a), shown 
in figure 11. There are thus grounds for optimism that we may be starting to attain a 
physical understanding of the origin of galaxy bias. 
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Figure 15. The power speetmm predieted for the halo model, for the flat fim = 0.3, F = 0.2, 
as ~ 0.8 case. The halo occupation numbers are predicted according to a simple two-parameter 
model, in which haloes are only included above some minimum value, Mmin, and receive a weight 
oc M". A model with 'mass traces light' would have Mmin ~ and a — 1. Motivated by the 
results on abundances of groups, we choose a = 0.5. Results are shown for Mmin in the range 
10^" (lowest) to 10^^'^h~^MQ (highest). The mass power spectrum is shown as a dashed line 
(linear theory) and as points (nonlinear). The second panel shows the scale-dependent relative 
bieis, i.e. the square root of the ratio between galaxy and mass power spectra. The dashed lines 
show the bias with respect to linear theory. The final panel shows the ratio to the canonical 
power-law spectrum; for Mmin in the region of lO^^/i^^M©, the result is within a factor 2 of a 
perfect power law over a factor 300 in scale. 

7. Anisotropies in the CMB 

Despite the great progress in precise measurements of large-scale structure, we cannot 
achieve a complete specification of the cosmological model in this way alone. The vacuum 
energy is not probed, since this affects mainly the growth rate of structure - which is 
degenerate with bias evolution. The matter content is only constrained if we assume that 
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n = 1, and even then we only measure Q.m if a value for h is supplied. A more complete 
picture is obtained if we include data on clustering at much earlier times: the anisotropy 
of the microwave background, which reaches us from z ~ 1100. In addition to breaking 
degeneracies, studies of this sort also test the basic gravitational instability theory - 
which will be seen to work very well indeed over this redshift range. This section briefly 
reviews the physics of CMB anisotropics, and presents recent data. For more details, see 
e.g. Hu & Dodelson (2002), or Dodelson (2003). 

7.1. Anisotropy mechanisms 

Fluctuations in the 2D temperature perturbation field arc treated similarly to density 
fluctuations, except that the fleld is expanded in spherical harmonics, so modes of differ- 
ent scales are labelled by multipole number, f . Once again, we can define a 'power per 
octave' measure for the temperature fluctuations: 

T\l) = £{£ + l)Q/27r; {{6T/Tf) = ^^(2^ + 1) Q/47r, (136) 

e 

where the Ci are another common way of representing the power. Note that is a 

power per In^; the modern trend is often to plot CMB fluctuations with a linear scale 
for £ - in which case one should really use T'^ {£)/£. 

We now list the mechanisms that cause primary anisotropics in the CMB (as opposed 
to secondary anisotropics, which are generated by scattering along the line of sight). 
There arc three basic primary effects, illustrated in figure 16, which are important on 
respectively large, intermediate and small angular scales: 

(1) Gravitational (Sachs-Wolfe) perturbations. Photons from high-density regions at last 
scattering have to climb out of potential wells, and are thus redshifted: 

ST $ 

(2) Intrinsic (adiabatic) perturbations. In high-density regions, the coupling of matter 
and radiation can compress the radiation also, giving a higher temperature: 

f = ^. (138) 

(3) Velocity (Dopplcr) perturbations. The plasma has a non-zero velocity at recombina- 
tion, which leads to Doppler shifts in frequency and hence brightness temperature: 

To the above list should be added 'tensor modes': anisotropics due to a background 
of primordial gravitational waves, potentially generated during an inflationary era (see 
below). 

There arc in addition effects generated along the line of sight. One important effect is 
the integrated Sachs- Wolfe effect, which arises when the potential perturbations evolve: 



5T _ 



dt. (140) 



This happens both at early times (because radiation is still important) and late times 
(because of A). Other effects arc to do with the development of nonlinear structure, and 
are mainly on small scales (principally the Sunyaev-Zeldovich effect from IGM Comp- 
tonization). The exception is the effect of reionization; to a good approximation, this 
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density perturbation ^ 

Figure 16. Illustrating the physical mechanisms that cause CMB anisotropics. The shaded arc 
on the right represents the last-scattering shell; an inhomogeneity on this shell affects the CMB 
through its potential, adiabatic and Doppler perturbations. Further perturbations are added 
along the line of sight by time- varying potentials (Rees-Sciama effect) and by electron scattering 
from hot gas (Sunyaev-Zeldovich effect). The density field at last scattering can be Fourier 
analysed into modes of wavevector k. These spatial perturbation modes have a contribution 
that is in general damped by averaging over the shell of last scattering. Short- wavelength modes 
are more heavily affected (i) because more of them fit inside the scattering shell, and (ii) because 
their wavevectors point more nearly radially for a given projected wavelength. 



merely damps the fluctuations on all scales: 

jr ^exp-T, (141) 

where the optical depth must exceed r ~ 0.04, based on the highest-redshift quasars and 
the BBN baryon density. As we will see later, CMB polarization data have detected a 
signature consistent with r = 0.17 ± 0.04, implying reionization at z ~ 20. 

7.2. Inflationary predictions 

The most commonly-discussed mechanism for generating the inhomogeneities that act as 
the source for 5T/T is inflation. Of course, CMB anisotropics were calculated in largely 
the modern way well before inflation was ever considered, by Peebles & Yu (1970). The 
standard approach involves super-horizon fluctuations, which must be generated by some 
acausal process. Inflation achieves this ~ but we cannot claim that detection of super- 
horizon modes amounts to a proof of inflation. Rather, we need some more characteristic 
signature of the speciflc process used by inflation: amplified quantum fluctuations (see 
e.g. chapter 11 of Peacock 1999 or Liddle & Lyth 2000 for details). 

In the simplest models, inflation is driven by a scalar flcld 0, with a potential V{<j)). As 
well as the characteristic energy density of inflation, V , this can be characterized by two 
parameters, e r], which are dimensionless versions of the flrst and second derivatives 
of V with respect to (j). In these terms, the inflationary predictions for the perturbation 
index is 

n = 1 - 6e -f 2?7. (142) 

Since inflation continues while e & 77 are small, some tilt is expected {\n — 1| ^ 0.01 to 
0.05 in simple models). 

The critical ingredient for testing inflation by making further predictions is the possi- 
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Figure 17. Angular power spectra T^(£) = £{£+l)Ce/2Tv for the CMB, plotted against angular 
wavenumber £ in radians"^. For references to the experimental data, see Spergel et al. (2003), 
Kuo et al. (2002) and Pearson et al. (2002). The two lines show model predictions for adiabatic 
scale-invariant CDM fluctuations, calculated using the cmbfast package (Seljak & Zaldarriaga 
1996). These have (n, fi^, ili,,h) = (1, 0.3, 0.05, 0.65) and have respectively r2„ = 1 — Qrn ('flat') 
and Qv = ('open'). The main effect is that open models shift the peaks to the right, as 
discussed in the text. 

bility that, in addition to scalar modes, the CMB could also be affected by gravitational 
waves (following the original insight of Starobinsky 1985). The relative amplitude of 
tensor and scalar contributions depended on the inflationary parameter e alone: 

^~12.4e~6(l-ns). (143) 

The second relation to the tilt is less general, as it assumes a polynomial-like potential, so 
that 77 is related to e. For example, V = X(j)^ implies rig — 0.95 and CJ /Cf ~ 0.3. To be 
safe, we need one further observation, and this is potentially provided by the spectrum 
of CJ . Suppose we write separate power-law index definitions for the scalar and tensor 
anisotropies: 

For the scalar spectrum, we had — n ~ 1 — 6e + 2r/; for the tensors, n^^ — I — 2e 
[although different definitions of jt-t exist; the convention here is that n = 1 always 
corresponds to a constant T'^{£)]. Thus, a knowledge of ris, Ut and the scalar-to-tensor 
ratio would overdetermine the model and allow a genuine test of inflation. 
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7.3. Characteristic scales 

The current data are contrasted with some CDM models in figure 17. The key feature 
that is picked out is the peak at ^ ~ 220, together with harmonics of this scale at higher 
i. Beyond t ~ 1000, the spectrum is clearly damped, in a manner consistent with the 
expected effects of photons diffusing away from baryons (Silk damping), plus smearing of 
modes with wavelength comparable to the thickness of the last-scattering shell. This last 
effect arises because recombination is not instantaneous, so the redshift of last scattering 
shows a scatter around the mean, with a thickness corresponding to approximately (7^ = 
l{rLrah?)~^^'^ Mpc. On scales larger than this, we see essentially an instantaneous imprint 
of the pattern of potential perturbations and the acoustic baryon/photon oscillations. 

The significance of the main acoustic peak scale is that it picks out the (sound) horizon 
at last scattering. The redshift of last scattering is almost independent of cosmological 
parameters at ^ls — 1100, although a more precise approximation is given in Appendix 
C of Hu & Sugiyama (1995). If we assume that the universe is matter dominated at last 
scattering, the horizon size is 

D"^ = 184 (0„/i2)-i/2Mpc. (145) 

The angle this subtends is given by dividing by the current size of the horizon (strictly, 
the comoving angular-diameter distance to ^ls)- Again, for a matter-dominated model, 
this is 

Dh = 6000^-1/1-1 Mpc ^ 9^ = D^yD^ocfl^J. (146) 

This expression lies behind the common statement that the CMB data require a flat 
universe. Figure 17 shows that heavily open universes yield a main CMB peak at scales 
much smaller than the observed £ ~ 220, and these can be ruled out. Indeed, open models 
were disfavoured for this reason long before any useful data existed near the peak, simply 
because of strict upper limits at ^ ~ 1500 (Bond & Efstathiou 1984). However, once a 
non-zero vacuum energy is allowed, the story becomes more complicated, and it turns 
out that large degrees of spatial curvature cannot be excluded using the CMB alone. 



7.4. Evolution of CMB data 

The pace of progress in CMB experiments has maintained an astonishing rate for a 

decade. Following the 1992 COBE detection of fluctuations, 5 years of effort yielded the 
unclear picture of the flrst panel in figure 18, in which of order 10 experiments gave only 
vague evidence for a peak in £^Ce. By the year 2000, this had been transformed to a 
clear picture of a peak at ^ ~ 200, although there was no model-independent evidence 
for higher harmonics. The present situation is much more satisfactory, with 3 peaks 
established in a way that does not require any knowledge of the CDM model. 

The WMAP results (Spcrgel et al. 2003) measure the power spectrum about as 
well as possible (i.e. hitting the limit of cosmic variance from a finite sky) up to 
the second peak. At smaller scales, however, there is still much scope for improve- 
ment, and the rate of advance is unlikely to drop in the future. The useful web page 
http://background.uchicago.edu/~whu/cmbex.html lists 14 ongoing experiments, as 
well as 19 completed ones. 



48 



J. A. Peacock: Surveys and cosmic structure 



i OVRO 




2003 - 




10 100 1000 



multipole number 2 

Figure 18. Dramatic change took place in CMB power spectrum measurements around the 
turn of the 21st century. Although some rise from the COBE level was arguably known even by 
1997, a clear peak around £ ~ 200 only became established in 2000, whereas by 2003 definitive 
measurements of the spectrum at £ < 800, limited mainly by cosmic variance, had been made 
by the WMAP satellite. 
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8. Model degeneracies inherent in CMB data 

8.1. Geometrical degeneracy 

The normal argument for flatness from the CMB starts with the comoving horizon size 
at last scattering 

D,s = (1 + ^.s)-'/' ^ 184(f]™/i2)-i/2 Mpc (147) 

and divides it by the present-day horizon size for a zero-A universe, 

1 /2 

to yield a main characteristic angle that scales as ftm ■ Large curvature (i.e. low Qm) is 
ruled out because the main peak in the CMB power spectrum is not seen at very small 
angles. However, introducing vacuum energy changes the conclusion. If we take a family 
of models with fixed initial perturbation spectra, fixed physical densities LUm = flmh^, 
iVb = f^6^^) it is possible to vary both fly and the curvature to keep a fixed value of 
the angular size distance to last scattering, so that the resulting CMB power spectra are 
identical. This degeneracy was analyzed comprehensively by Efstathiou & Bond (1999), 
and we now summarize the main results. 

The usual expression for the comoving angular-diameter distance is 
c 



ihSkir) ~\i-n\-y^ 



Sk 



f 

Jo 



\i-n\'/^dz' (149) 



V(l - fi)(l + Z')^ + ^v+ J^m(l + Z')^ 



where O = flm + Defining = flih^, this can be rewritten in a way that has no 
explicit h dependence: 



3000 Mpc ^ 
RoSk{r) = — — TTii^Sk 



1/2 dz' 



Vwfe(l + z'f a;„(l z'f 



(150) 



whore cjfe = (1 — fl^ — fl^^lr? . This parameter describes the curvature of the universe, 
treating it effectively as a physical density that scales as p oc a"^. This is convenient 
for the present formalism, but it is important to appreciate that curvature differs funda- 
mentally from a hypothetical fluid with such an equation of state. 

The sound horizon distance at last scattering is governed by the relevant physical 
densities, ujm and Wf,; if and u)h are given, the shape of the spatial power spectrum is 
determined. The translation of this into an angular spectrum depends on the angular- 
diameter distance, which is a function of these parameters, plus iOk and lo^. Models 

1/2 

in which RoShir) is a constant have the same angular horizon size. For fixed 
and Lo-h^ there is therefore a degeneracy between curvature (iOk) and vacuum (ys^): these 
two parameters can be varied simultaneously to keep the same apparent distance, as 
illustrated in figure 19. 

In short, this degeneracy occurs because the physical densities control the structure of 
the perturbations in physical Mpc at last scattering, while curvature, fly and flm govern 
the proportionality between length at last scattering and observed angle. The degeneracy 
is not exact, and is weakly broken by the Integrated Sachs- Wolfe effect from evolving 
potentials at very low multipoles, and second-order effects at high I. However, strong 
breaking of the degeneracy requires additional information. This could be in the form of 
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Figure 19. The geometrical degeneracy in the CMB means that models with fixed flmh^ 
and flbh^ can be made to look identical by varying the curvature against vacuum energy, 
while also varying the Hubble parameter. This degeneracy is illustrated here for the case 
LOm = 0,mh^ = 0.2. Models along a given line are equivalent from a CMB point of view; 
corresponding lines in the upper and lower panels have the same line style. Note that supplying 
external information about h breaks the degeneracy. This figure assumes scalar fiuctuations 
only; allowing tensor modes introduces additional degeneracies - mainly between the tensor 
fraction and tilt. 



external data on the Hubble constant, which obeys the relation 

/l^ = U!„l +Uly +U}k, (151) 

so specifying h in addition to the physical matter density fixes cuy + and removes the 
degeneracy. A more elegant approach is to add results from large-scale structure, so that 
conclusions are based only on the shapes of power spectra. Efstathiou et al. (2002) show 
that doing this yields a total density (|ri — 1| < 0.05) at 95% confidence. 

8.2. Horizon- angle degeneracy 

As we have seen, the geometrical degeneracy can be broken either by additional infor- 
mation (such as a limit on h), or by invoking a theoretical prejudice in favour of flatness. 
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Even for flat models, however, there stiU exists a version of the same degeneracy. What 
determines the CMB peak locations for flat models? The horizon size at last scatter- 
ing is = 184 (f2m/i^)~^''^Mpc. The angular scale of these peaks depends on the 
ratio between the horizon size at last scattering and the present-day horizon size for flat 
models: 

Dh = 6000 0-°-^ /i-^Mpc => = -DhV^h oc (152) 

(using the approximation of Vittorio & Silk 1985). This yields an angle scaling as 

so that the scale of the acoustic peaks is apparently almost independent of the main 

parameters. 

However, this argument is incomplete because the earlier expression for Dh{zi,s) as- 
sumes that the universe is completely matter dominated at last scattering, and this is 
not perfectly true. The comoving sound horizon size at last scattering is defined by (e.g. 
Hu & Sugiyama 1995) 

Ds{zi^s) = 775/ 7 — , \,/2 da (153) 

HonU Jo (a + acq) 

where vacuum energy is neglected at these high redshifts; the expansion factor a = 
(1 + and Ols I Ocq are the values at last scattering and matter-radiation equality 
respectively. In practice, Zls — 1100 independent of the matter and baryon densities, 
and Cs is fixed by Thus the main effect is that Oeq depends on fi^. Dividing by 
Dh{z = 0) therefore gives the angle subtended today by the light horizon as 



(154) 



OlS V '^LS 

where = 1100 and Ooq = (23900 o'm)"'^. This remarkably simple result captures 
most of the parameter dependence of CMB peak locations within flat ACDM models. 
Differentiating this equation near a fiducial Um = 0.147 gives 

Ul + — V'^' = +0-24, (155) 



ain6'H 



= -0.1; ''^'^ 



dlnuJr, 



n 2 



in good agreement with the numerical derivatives in Eq. (A15) of Hu ct al. (2001). 

Thus for moderate variations from a 'fiducial' model, the CMB peak multipole number 
scales approximately as ^peak oc i.e. the condition for constant CMB peak 

location is well approximated as 

^mh^''^ = constant. (156) 

However, information about the peak heights does alter this degeneracy slightly; the 
relative peak heights are preserved at constant Qm, hence the actual likelihood ridge is 
a 'compromise' between constant peak location (constant ^Irnh^''^) and constant relative 
heights (constant fi,,, h"^ ) ; the peak locations have more weight in this compromise, leading 
to a likelihood ridge along approximately flmh^''^ — const (Percival et al. 2002). It is now 
clear how LSS data combines with the CMB: 0„i/i'''^ is measured to very high accuracy 
already, and Percival et al. deduced Qmh^''^ = 0.078 with an error of about 6% using 
pre-WMAP CMB data. The first-year WMAP results in fact prefer O^/i^-^ = 0.084 
(Spergel et al. 2003); the slight increase arises because WMAP indicates that previous 
datasets around the peak were on average calibrated low. 

8.3. Tensor degeneracy 

All of the above applies to models in which scalar modes dominate. The possibility of a 
large tensor component yields additional degeneracies, as shown in figure 20. An n = 1 
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Figure 20. The tensor degeneracy. Adding a large tensor component to an n = 1 scalar model 
(solid line) greatly lowers the peak (dashed line), once COBE normalization is imposed. Tilting 
to n = 1.3 cures this (dot-dashed line), but the 2nd and subsequent harmonics are too high. 
Raising the baryon density by a factor 1.5 (dotted line) leaves us approximately back where we 
started. 



model with a large tensor component can be made to resemble a zero-tensor model with 
large blue tilt (n > 1) and high baryon content. Efstathiou et al. (2002) show that 
adding LSS data does not remove this degeneracy; this is reasonable, since LSS data 
only constrain the baryon content weakly. A better way of limiting the possible tensor 
contribution is to look at the amplitude of mass fluctuations today: this normalization 
of the scalar component is naturally lower if the CMB signal is dominated by tensors. 
These issues are discussed further below. 

Another way in which the remaining degeneracy may be lifted is through polariza- 
tion of the CMB fluctuations. A nonzero polarization is inevitable because the electrons 
at last scattering experience an anisotropic radiation field. Thomson scattering from an 
anisotropic source will yield polarization, and the practical size of the fractional polariza- 
tion P is of the order of the quadrupole radiation anisotropy at last scattering: P > 1%. 
This signal is expected to peak at £ ~ 500, and the effect was first seen by the DASI 
experiment (Kovac et al. 2002). Much more detailed polarization results were presented 
by the WMAP satellite, including a critical detection of large-scale polarization arising 
from secondary scattering at low z, thus measuring the optical depth to last scattering 
(Kogut et al. 2003). On large scales, the polarization signature of tensor perturbations 
differs from that of scalar perturbations (e.g. Seljak 1997; Hu & White 1997); the differ- 
ent contributions to the total unpolarized Ci can in principle be disentangled, allowing 
the inflationary test to be carried out. 
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9. Combination of the CMB and large-scale structure 

The 2dFGRS power spectrum contains important information about the key parame- 
ters of the cosmological model, but we have seen that additional assumptions are needed, 
in particular the values of n and h. Observations of CMB anisotropics can in principle 
measure most of the cosmological parameters, and combination with the 2dFGRS can 
lift most of the degeneracies inherent in the CMB-only analysis. It is therefore of interest 
to sec what emerges from a joint analysis. 

The clearest immediate result is that the geometrical degeneracy becomes broken (Ef- 
stathiou et al. 2002). A 95% confidence upper limit on any curvature can be set at 
|ri — 1| < 0.05. Wc can therefore be confident that the imivcrsc is very nearly flat so it is 
defensible to assume hereafter that this is exactly true. The importance of tensors will 
of course be one of the key questions for cosmology over the next several years, but it is 
interesting to consider the limit in which these arc negligible. In this case, the standard 
model for structure formation contains a vector of only 6 parameters: 

= {ns,Q.m,Q.b,h,Q,T). (157) 

Of these, the optical depth to last scattering, r, is almost entirely degenerate with the 
normalization, Q - and indeed with the bias parameter: we discuss this below. The 
remaining four parameters are pinned down very precisely: using a compilation of pre- 
WMAP CMB data plus the 2dFRGS power spectrum, Percival et al. (2002) obtained 

{ris, Wc, W6, h) = (0.963 ± 0.042, 0.115 ± 0.009, 0.021 ± 0.002, 0.665 ± 0.047), (158) 

or an overall density parameter of Q.m = 0.313 ± 0.055. 

It is remarkable how well these figures agree with completely independent determina- 
tions: h = 0.72 ± 0.08 from the HST key project (Mould et al. 2000; Freedman et al. 
2001); Q-bh? = 0.020 ± 0.001 (Buries et al. 2001). This gives confidence that the tensor 
component must indeed be sub-dominant. 

This analysis was published in Percival et al. (2002), and is based on the preliminary 
version of the 2dFGRS power spectrum, from Percival et al. (2001). We can make a first 
estimate of how this is likely to change using the Vlmh = 0.18 ±0.02 from the preliminary 
analysis of P{k) from the final dataset. In combination with the WMAP 0,^^'^ '^ = 0.084 
from the CMB peak degeneracy, this yields 

dm = 0.25 ± 15% h = 0.73 ± 5% (159) 

as the preferred current figures from an analysis of this type. The matter density remains 
frustratingly imprecise, and it is clear that it will be very hard to measure h accurately 
enough to cure this problem. However, complementary constraints on flrn exist at similar 
precision (e.g. fi™ = 0.28 ± 18% for a flat model from the SNe la Hubble diagram; Tonry 
et al. 2003). With new results from gravitational lensing, fi^ should be measured to 
better than 10% precision within a year. 

Perhaps the most striking conclusion from these results concerns the nature of the 
primordial fluctuations, which remain consistent with the n = 1 scale-invariant form. 
The WMAP analysis of Spergcl et al. (2003) yields 0.97 ± 0.03 from CMB plus 2dFGRS 
(cf. 0.96 ± 0.04 from Percival et al. 2002). The WMAP team also consider adding data 
from the Lyman-a forest, which pushes the solution away from a pure power-law: 

n = 0.93 ±0.03 = -0.031 ± 0.016. (160) 

dmk 

This evidence for running of n is at best marginal, and disappears completely when sys- 
tematic uncertainties in the Lyman-a data are considered (Seljak, McDonald & Makarov 
2003). It would in any case be surprising if true, since simple inflation models suggest 
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that dn/dlnk should be second order in (n — 1 ) . Although the tensor degeneracy prevents 
any very strong statements, the data are best described by pure scalar fluctuations, and 
Percival et al. (2002) set an upper limit of 0.7 to the tensor-to-scalar ratio. 

The agreement with pure scalar n = 1 is not yet a strong embarrassment for infla- 
tion, but it is starting to bite on some inflationary models. Leach & Liddle (2003) show 
that CMB plus 2dFGRS arc inconsistent with the V = A^** model at just about 95% 
confidence. It is possible to set up inflation models in which tilt and tensors are both neg- 
ligible, but there has been a long-standing hope for more substantial signs of inflationary 
dynamics; if these are not seen soon, it will be a major disappointment. 



The above conclusions were obtained by considering the shapes of the CMB and galaxy 

power spectra. However, it is also of great interest to consider the amplitude of mass 
fluctuations, since a comparison with the galaxy power spectrum allows us to infer the 
degree of bias directly. This analysis was performed by Lahav et al. (2002). Given 
assumed values for the cosmological parameters, the prcscnt-day linear normalization of 
the mass spectrum (e.g. erg) can be inferred. It is convenient to define a corresponding 
measure for the galaxies, agg, such that we can express the bias parameter as 



In practice, we define CTgg to be the value required to fit a CDM model to the power- 
spectrum data on linear scales (0.02 < k < 0.15 /iMpc~^). The amplitude of 2dFGRS 
galaxies in real space estimated by Lahav ct al. (2002) is (t||(_L*) = 0.76, with a negligibly 
small random error. This assumes no evolution in asg, plus the luminosity dependence 
of clustering measured by Norberg et al. (2001). 

The value of as for the dark matter can be deduced from the CMB fits. Percival et al. 
(2002) obtain 



where the error bar includes both data errors and theory uncertainty. The WMAP 
number here is almost identical: as exp(— r) = 0.71, but no error is quoted (Spcrgel et 
al. 2003). The unsatisfactory feature is the degeneracy with the optical depth to last 
scattering. For reionization at redshift 8, we would have r ~ 0.05; it is not expected 
theoretically that r can be hugely larger, and popular models would place reionization 
between z = 10 and z = 15, or t ~ 0.1 (e.g. Loeb & Barkana 2001). One of the many 
impressive aspects of the WMAP results is that they are able to infer r = 0.17±0.04 from 
large-scale polarization. Taken at face value, r = 0.17 would argue for reionization at 
z = 20, but the error means that more conventional figures are far from being ruled out. 
Taking all this together, it seems reasonable to assume that the true value of as is within 
a few % of 0.80. Given the 2dFGRS figure of (t|| = 0.76, this implies that L* galaxies 
are very nearly exactly unbiased. Since there are substantial variations in the clustering 
amplitude with galaxy type, this outcome must be something of a coincidence. This 
conclusion of near-unity bias was reinforced in a completely independent way, by using 
the measurements of the bispcctrum of galaxies in the 2dFGRS (Verde et al. 2002) . As it 
is based on three-point correlations, this statistic is sensitive to the filamentary nature of 
the galaxy distribution - which is a signature of nonlinear evolution. One can therefore 
split the degeneracy between the amplitude of dark-matter fluctuations and the amount 
of bias. 

These conclusions point the way towards a possible limit on the tensor contribution: a 



9.1. Matter fluctuation amplitude and bias 




(161) 



cr8Cxp(-T) = 0.72 ±0.04, 



(162) 
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large contribution of tensors to the COBE signal would lower the required scalar ampli- 
tude. As an extreme example, a scalar-to-tensor ratio of 1 would reduce the 'apparent' 
cTg by roughly a factor of \/2. to 0.5. Even for an implausibly large value of t, this would 
be hard to reconcile with the level of galaxy clustering plus the requirement of a low 
degree of bias. Also, more direct limits on erg are now being derived from large-scale 
gravitational lensing surveys, with erg ~ 0.7 to 0.8 being favoured (e.g. Brown et al. 
2003; Jarvis et al. 2003). 



10. Less-standard ingredients 

10.1. Limits to the neutrino mass 

Even though a CDM-dominated universe matches the data very well, there are many 
plausible variations to consider. Probably the most interesting is the neutrino mass: 
experimental data on neutrino oscillations mean that at least one neutrino must have a 
mass of > 0.05 eV, so that > 10~^ - the same order of magnitude as stellar mass. 
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Figure 21. Results from Elgaroy et al. (2002), who considered constraints on the neutrino 
mass from 2dFGRS. The first panel shows Power spectra for = (solid line), Qi^ = 0.01 
(dashed line), and 0,^ = 0.05 (dot-dashed line) with amplitudes fitted to the 2dFGRS power 
spectrum data (vertical bars). Other parameters are fixed at f2m = 0.3, = 0.7, h = 0.7, 
ilhh^ = 0.02. The vertical dashed lines limit the range in k used in the fits. The second panel 
shows 68% (solid line), 95% (dashed line) and 99% (dotted line) confidence contours in the plane 
of fi, = Hiy/Qm and F = flmh, with marginalization over h and Qhh'^ using Gaussian priors. 



As explained in earlier, a non-zero neutrino mass can lead to relatively enhanced large- 
scale power, beyond the neutrino free-streaming scale. This is illustrated in figure 21, 
taken from Elgaroy et al. (2002). Broadly speaking, allowing a significant neutrino mass 
changes the spectrum in a way that resembles lower density, so there is a near-degeneracy 
between neutrino mass fraction and fl^h (figure 21). A limit on the neutrino fraction 
thus requires a prior on ^mh. Based on the cluster baryon fraction plus BBN, Elgaroy et 
al. adopt ilm < 0.5; together with the HST Hubble constant, this yields a marginalized 
95% limit of fv < 0.13, or rriv < 1.8 eV. Note that this is the sum of the eigenvalues 
of the mass matrix: given neutrino oscillation results (e.g. Ahmad et al. 2002; Eguchi 
et al. 2003), the only way a cosmologically significant density can arise is via a nearly 
degenerate hierarchy, so this allows us to deduce rriv < 0.6 eV for any one species. 
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Including the latest WMAP results in order to set a more strict limit on ilmh, this limit 
falls to 0.23 eV (Spergel et al. 2003). 

10.2. The equation of state of the vacuum 

So far, we have assumed that the vacuum energy is exactly a classical A, or at any rate 
indistinguishable from one. This is a highly reasonable prior: there is no reason for the 
asymptotic value of any potential to go exactly to zero, so one always needs to solve 
the classical cosmological constant problem - for which probably the only reasonable 
explanation is an anthropic one (e.g. Vilenkin 2001). Therefore, dynamical provision of 
w = Pv/ Pv 7^ —1 is not needed. Nevertheless, one can readily take an empirical approach 
to w (treated as a constant for a first approach). 




Figure 22. The fim/i'' '' degeneracy for flat models gives an almost exact value of Qrn from 
the CMB is h is known, assuming the vacuum to be effectively a classical A {w = —1). If to is 
allowed to vary, this becomes a locus on the {Qm , w) plane (similar to the locus for best-fitting 
flat models from the SNe, showed dotted). Solid circles show values of ^Imh that satisfy the 
updated 2dFGRS constraint of 0.18 (suppressing error bars). 

Figure 22 shows a simplified approach to this, plotting the locus on {w, flm) space 
that is required for a given value of h if the location of the main CMB acoustic peak 
is known exactly. For h ~ 0.7, this is very similar to the locus derived from the SN 
Hubble diagram (Garnavich et al. 1998). The solid circles show the updated 2dFGRS 
constraint of ilmh = 0.18. In order to match the data with w closer to zero, flm must 
increase and h must decrease. The latter trend means that the HST Hubble constant 
sets an upper limit to w of about —0.54 (Percival et al. 2002). This is very similar to 
the SNe constraint of Garnavich et al. (1998), so the combined limit is already close to 
w < —0.8. The vacuum energy is indeed looking rather similar to A. 
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10.3. The total relativistic density 

Finally, an interesting aspect of figure 22 is that it reminds us of history. When the 
COBE detection was announced in 1992, a popular model was 'standard' CDM with 
flm = 1, h ^ 0.5. As wc sec, this comes close to fitting the CMB data, and such 
a model is not unattractive in some ways. Can we be sure it is ruled out? Leaving 
aside the SNe data, one might think to evade the 2dFGRS constraint by altering the 
total relativistic content of the universe (for example, by the decay of a heavy neutrino 
after nucleosynthesis). Since 2dFGRS measures the horizon at matter-radiation equality, 
this will be changed. If the radiation density is arbitrarily boosted by a factor X, the 
constraint from LSS becomes 

(^m^)apparent — X ^ (Q^/?.)true* (-^^*^) 

Therefore X ~ 8 is required to allow an Einstein-de Sitter universe. 

However, this argument fails, because it does not take into account the effect of the 
extra radiation on the CMB. As argued above, the location of the acoustic peaks depends 
on Acq, which depends on cUm- However, if we change the radiation content, then what 
matters is uJm/X. Thus, the CMB peak constraint now reads 

f2-°-i(w™/X)°-24 = constant; (164) 

when combining LSS and CMB, everything is as before except that the effective Hubble 
parameter is hjX^^"^ . Thus, a model with Q.„i = 1 but boosted radiation would only 
fit the CMB with h ~ 0.5-\/8 ~ 1.4, and the attractiveness of a low age is lost. In any 
case, combining LSS and CMB would give the same Qm — 0.3 independent of X, so it is 
impossible to save models with fl„i = 1 by this route. 

Finally, it is interesting to invert this argument. Since Percival et al. (2002) obtain 
an effective h of 0.665 ± 0.047 and Freedman et al. (2001) measure h = 0.72 ± 0.08, we 
deduce 

1.68X = 1.82 ±0.24. (165) 

This convincingly rules out the 1.68X = 1 that would apply if the universe contained 
only photons, and amounts to a detection of the neutrino background. In terms of the 
number of neutrino species, this is N^, = 3.6 ± 1.1. A more precise result is of course 
obtained from primordial nucleosynthesis, but this applies at a much later epoch, thus 
constraining models with decaying particles. 

11. Conclusions 

The beautiful data on the large-scale structure of the universe revealed in particu- 
lar by the 2dF Galaxy Redshift Survey combine with the incredible recent progrc;ss in 
CMB data to show spectacularly good agreement with a 'standard model' for structure 
formation. This consists of a scalar-mode adiabatic CDM universe with scale-invariant 
fluctuations. Measuring the exact parameters of this model is rendered difficult by the 
intrinsic degeneracies of the structure-formation process, but progress is being made. 
The most recent data yield flm = 0.25 ± 15% and h = 0.73 ± 5%; these figures accord 
well with independent constraints, and it is very hard to believe that they are incorrect. 

Allowing extra degrees of freedom, such as massive neutrinos, vacuum equation of state 
m; ^ 1, or extra relativistic content worsens the agreement with independent constraints 
on h and flm- This both supports the simplest picture and allows us to set interesting 
limits on these non-standard ingredients. 

For the future, we can look with anticipation to meaningful tests of inflation: the 
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current data are consistent with n = 1 to an error of ±0.03, and the errors may be 
expected to halve over the next couple of years, bringing plausible levels of tilt well 
within the reach of experimental detection. A demonstration that n 1 would be a 
large step in the direction of proving inflation to be true, so the cosmological stakes 
over the next few years will be high. The tensor fraction is a less clear target, but the 
motivation to improve on the current weak upper limits will remain strong. 

It should of course not be forgotten that the large-scale structure we measure locally 
consists of galaxies. In these lectures, the physics of galaxy formation has been largely 
ignored, but this will be the increasing focus of LSS studies: not just the global pa- 
rameters of the universe, but the detailed understanding of how the complex structures 
around us formed. 
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